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EDITORIAL 


Chinese scientists and security 


any scientific and economic successes in 
the United States are owed to the work of 
foreign-born scientists. Today, a substan- 
tial percentage of U.S patents and scientific 
publications are contributed by Chinese- 
born scientists. But there is increasing con- 
cern that China is unfairly and systemati- 
cally attempting to capture intellectual property, trade 
secrets, and advanced technologies from the United 
States, all detrimental to productive long-term rela- 
tionships. As part of the U.S. response to these threats, 
tenured Chinese-American scientists at top institu- 
tions of biomedical research were recently dismissed 
for purported violations of disclosure rules, breaches 
of confidentiality, or outright suspicion of espionage. 
As Congress seeks to protect the 
open and collaborative approach to 
science and technology from nefar- 
ious activities of foreign states, the 
scientific community must weigh 
in on the path forward. 

We should remember that for 
years, scientific exchanges and 
collaborations with China were 
encouraged by U.S. policy-makers, 
including implicit support of Chi- 
nas Thousand Talents Program. 
Chinese-born as well as American-born federally 
funded scientists were publicly offered various posi- 
tions in China over the years without opposition by 
relevant institutions. The “rules,” now presented and 
enforced as severe violations of U.S. ethics and intel- 
lectual property regulations, were not rigorously im- 
plemented by officials at many U.S. institutions. The 
consternation, sense of targeted discrimination, and 
fear in the Chinese-American scientific community are 
thus understandable. 

U.S. science and technology are a cornerstone of 
competitiveness and preeminence in the world, and 
the security and protection of this vital enterprise 
are of paramount importance. But unlike most other 
countries, the United States relies heavily on attract- 
ing the best and brightest in the world to its ecosys- 
tem of innovation because of an insufficient number 
of graduates in science, technology, engineering, and 
mathematics fields relative to the size and technologi- 
cal intensity of its economy. Would it be in the national 
interest to risk losing all or some of the extraordinarily 


“athe United States 
relies heavily on 
attracting the best 
and brightest...” 


productive Chinese-American scientific diaspora 
trained and supported for years in the United States? 

In my experience in academia, government, and in- 
dustry, most Chinese-born scientists prefer to stay in 
the United States for personal and family preferences. 
One can imagine the glee of Chinese officials at the 
prospect of thousands of Chinese-American scientists, 
whom they were unable to recruit until now, relocat- 
ing back to China. So, what should be done? 

Congress is conducting hearings and considering 
passing the Securing American Science and Technol- 
ogy Act of 2019 as part of the Defense Authorization 
bill—a good step forward. Although this bill is pro- 
gressing at its usual pace in the legislative branch, 
time is not on our side. The executive branch should 
promptly establish a blue-ribbon 
panel composed of security experts, 
agencies representatives, academ- 
ics, and prominent scientists and 
engineers under the auspices of the 
Office of Science and Technology 
Policy and the National Security 
Council to fully debate the facts, 
ways, and means to effectively pro- 
tect U.S. science and technology 
while preserving the attractiveness 
of the United States to talented 
foreign-born scientists, most particularly from China. 
It should clearly define new rules of foreign scientific 
engagement; develop a risk matrix by area of science 
and technology to guide institutions; instruct agencies 
such as the Department of Commerce in the imple- 
mentation of all rules of export or import controls and 
material and information transfers in a manner simi- 
lar to that for dual-use research guidelines; mandate a 
compliance and education program for all concerned; 
and find explicit ways to reassure the Chinese-Ameri- 
can scientific community at large. 

No members of any community should be targeted 
because of their origins, rather than for what they may 
have intentionally and demonstrably done to harm 
U.S. science and technology. The United States should 
not risk losing critical intellectual assets such as pro- 
ductive foreign-born scientists and engineers to global 
competitors to serve short-term security concerns at 
the expense of long-term national interests. 


-Elias Zerhouni 
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ive adventurers plan to paddle a hand-hewn log boat 

200 kilometers from Taiwan, China, to the western- 

most island of Okinawa, Japan, to show that Paleolithic 

voyagers could have made similar trips in migrating 

to the remote Pacific isles more than 30,000 years ago. 

Yousuke Kaifu, an archaeologist at Japan’s National 
Museum of Nature and Science in Tokyo, dreamed up the 
project to solve the mystery of how early humans populated 
the Ryukyu Islands, in present-day Okinawa. Previous trials 
in the past 3 years using reed boats and bamboo rafts over 


Citizenship question blocked 


U.S. CENSUS | The U.S. Supreme Court 
last week told Commerce Secretary 
Wilbur Ross that he must do a better 
job of explaining his decision to add a 
citizenship question to the 2020 census. 
Writing the 5-4 opinion, Chief Justice 
John Roberts pointed to a “signifi- 

cant mismatch” between Ross’s official 
stance—that he was acquiescing to the 
Department of Justice’s request for better 
data to enforce the Voting Rights Act— 
and strong evidence that he had made 
up his mind months before he received 
such a request. The high court’s ruling 
says, in effect, that Ross failed to meet 
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the standards for justifying his actions 
under the Administrative Procedure Act 
(APA). “I’m pretty happy today,’ said John 
Thompson, who stepped down as director 
of the Census Bureau in June 2017 and 
has argued that adding the question could 
increase the cost and compromise the 
accuracy of the census. “Holding Secretary 
Ross accountable under APA ... preserves 
the nation’s statistical system.” 


LGBT survey shows discrimination 


WORKPLACE | Harassment and exclusion 
are common experiences for members 

of sexual and gender minorities in the 
physical sciences, according to a survey 


shorter distances were unsuccessful. But the dugout canoe, 
hewn from a 1-meter-thick tree trunk using a primitive stone 
ax, has proved to be more buoyant and faster moving than the 
previous crafts. The crew of five includes a Maori man who 
will navigate by traditional methods of following the stars and 
judging winds and ocean swells. As Science went to press, the 
team awaited good weather to launch the voyage, which could 
take 30 to 40 hours. Kaifu says the chance of success “is not 
very high.” Even if the attempt fails, the team expects to gain 
insights into the challenges of prehistoric ocean travel. 


released 26 June by the United Kingdom’s 
Institute of Physics, Royal Astronomical 
Society, and Royal Society of Chemistry. 
It included responses from 637 scientists 
working in the United Kingdom, most of 
whom identified as lesbian, gay, bisexual, 
transgender, or another sexual or gender 
minority. A climate of discrimination had 
prompted 28% of respondents to consider 
leaving their jobs. And 16% said they had 
experienced “exclusionary, intimidat- 

ing, offensive or harassing behaviour” in 
the past 12 months; among transgender 
people, that figure was 32%. The report 
highlights “areas for increased action” 
such as workplace training on transgender 
inclusion and correct pronoun usage. 
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Viruses blamed for pneumonia 


GLOBAL HEALTH | Pneumonia kills nearly 
1 million children every year, but the lung 
infection’s precise cause can be hard to 
pinpoint, and doctors often prescribe 
antibiotics without knowing that bacteria 
are to blame. A new study has found that 
61% of childhood pneumonia worldwide 
is caused by viruses. In 4252 pneumonia 


Sa 


Te ad virus aaa causes nearly one- 
third of all pneumonia cases in children under age 5. 
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cases among children in Africa and Asia, 
researchers used nose and throat swabs 
and a variety of fluid samples to infer the 
cause of infection from a list of 30 patho- 
gens. Viruses were the dominant culprits, 


the team reported last week in The Lancet, 


and one in particular—respiratory syncy- 
tial virus—accounted for nearly one-third 
of cases. The finding reveals the dramatic 
impact that sorely needed vaccines would 
have on the disease, the researchers say. 
Their new diagnostic approach might also 
help reduce the overuse of antibiotics. 


Contrails threaten climate 


CLIMATE SCIENCE | Carbon emissions 
from aviation are a well-known con- 
tributor to climate warming. But another 
byproduct of planes—the white contrails 
they paint across the sky—has an even 
bigger effect. Contrails form when water 
vapor in a plane’s exhaust freezes, and 
unlike low-level clouds that reflect sun- 
light to cool the atmosphere, they trap 
heat radiated from Earth’s surface, warm- 
ing it. Anew model described last week in 
Atmospheric Chemistry and Physics is the 
first to classify contrail clouds separately 
from natural ones. Researchers predicted 
a threefold increase in contrail warming 
effects by 2050. They also estimated that 
a 50% reduction in airplane soot, which 
promotes the formation of contrails, 
would lead to a 15% decrease in those 
effects. But such reductions are unlikely; 
global air traffic is surging, and most 
pollution-reduction plans in the industry 
fail to consider climate impact beyond 
carbon dioxide emissions. 


Pain society to shut down 


PROFESSIONAL GROUPS | Under finan- 
cial pressure from opioid lawsuits, the 
American Pain Society (APS) in Glenview, 
Illinois, announced last week that it 
would cease operations after its members 
overwhelmingly voted in favor of filing 
for bankruptcy. The 42-year-old organi- 
zation has hosted an annual meeting, 
awarded small grants for basic research 
and early career scientists, and published 
The Journal of Pain. It has recently been 
among the defendants in opioid-related 
lawsuits and faced allegations it had 
represented drugmakers’ interests and 
contributed to the overprescribing of the 


painkillers that led to the opioid epidemic. 


In a statement, APS described those suits 
as “spurious” and cited legal expenses 

as part of a “perfect storm,” along with 
declining sponsorship, membership, and 
meeting attendance. 
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BY THE NUMBERS 


million 
People still at risk of the bacterial 
illness trachoma, the leading 
infectious cause of blindness—down 
91% since 2002 thanks to the 
distribution of antibiotics and other 
public health campaigns (World 
Health Organization). 


3% 


Proportion of U.S. electricity 
generated by renewable sources 
as of April, the first month 
in which renewables surpassed 
coal’s contribution—estimated 
at 20% (U.S. Energy Information 
Administration). 


Bottlenose dolphins estimated to 
have stranded and died in the past 
year because of a harmful algal 
bloom along the southwest coast 
of Florida (U.S. National Oceanic 
and Atmospheric Administration). 


Zambian river will keep flowing 


EcoLoay | After an outcry from conser- 
vation groups, the Zambian government 
has withdrawn approval for a dam on the 
1100-kilometer Luangwa River, one of the 
longest free-flowing rivers left in southern 
Africa. The World Wildlife Fund (WWF) 
and other groups have opposed plans by 

a South African power company to create 

a 1510-square-kilometer reservoir, which 
researchers have determined would flood 
out parts of several tribal, hunting, and 
protected areas and shrink a wildlife cor- 
ridor between two national parks. WWF 
freshwater scientist Michele Thieme called 
the Luangwa decision “a major win.” Recent 
public opposition in Chile has killed similar 
projects on the Baker and Pascua rivers, two 
of the country’s wildest waterways. 
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SCIENCE POLICY 


FDA enforcement actions 
plummet under Trump 


Agency is sending many fewer warning letters 


By Charles Piller 


rom monitoring clinical trials and ap- 

proving medicines and vaccines, to en- 

suring the safety of blood transfusions, 

medical devices, groceries, and more, 

the U.S. Food and Drug Administration 

(FDA) is one of the nation’s most vital 
watchdogs. By several measures, however, 
FDAs compliance and enforcement actions 
have plummeted since President Donald 
Trump took office, Science has found. 

The agency’s “warning letters’—a key 
tool for keeping dangerous or ineffective 
drugs and devices and tainted foods off the 
market—have fallen by one-third, for ex- 
ample. Such letters typically demand swift 
corrections to protect public health and 
safety. FDA records from Trump’s inaugura- 
tion through 22 May show the agency issued 
1033 warning letters, compared with 1532 
for the most recent equivalent period under 
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former President Barack Obama. Compared 
with the start of the Obama presidency, 
Trump-era letters dropped by nearly half. 
Warnings from the FDA Center for De- 
vices and Radiological Health, which helps 
ensure the safety and quality of medical de- 
vices, and from some of the agency’s district 
offices—including Philadelphia, Florida, and 
New York—have dropped even more steeply, 
by more than two-thirds. Two district of- 
fices have not issued a warning in more than 
2 years. The numbers don’t just reflect a new 
administration’s slow start. FDA sent signifi- 
cantly fewer warning letters in the second 
year of Trump’s presidency than in his first. 
FDA watchers say they can’t pinpoint 
what’s driving the decline, but they are 
alarmed. “Those who think the Trump ad- 
ministration has not succeeded in its deregu- 
latory efforts ought to look at these data,” says 
Peter Lurie, an FDA executive under Obama 
and Trump and now executive director of the 
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Center for Science in the Public Interest, a 
Washington, D.C., advocacy group. “Industry 
may well take the message from this that the 
cop is not on the beat as often.” 

Several other FDA actions under Trump 
show similar declines when measured 
against the end of the Obama administration. 
FDA inspection reports labeled “official ac- 
tion indicated”—typically a trigger for warn- 
ing letters or similar actions—have fallen by 
about half under Trump and are continuing 
to trend downward. Even FDA’s rare injunc- 
tions, a more forceful step than warnings 
to prevent sales or distribution of unsafe or 
otherwise illegal products, fell from 35 in the 
last part of the Obama administration to 26 
under Trump. (During a comparable period 
at the start of the Obama years, FDA issued 
51 injunctions.) The agency’s “untitled 
letters”’—for concerns that fall short of thresh- 
olds for formal warnings—also have dropped 
sharply under Trump. 

“FDA's power to enforce its requirements 
is an important part of how it achieves its 
public health mission,” says Patricia Zettler, 
a law professor at Ohio State University in 
Columbus and former FDA attorney. “If FDA 
is not using that power, it sends a signal that 
violations will be tolerated.” 

FDA did not dispute the enforcement and 
compliance data, which Science compiled 
from the agency’s own public records. In a 
written statement, the agency responded, 
“Sometimes the actions we take are visible, 
like warning letters (or) recalls. ... At other 
times, our actions to protect consumers are 
less discernible, but equally vital.” Among the 
efforts that can obviate the need for warn- 
ing letters, FDA noted, are meetings with 
companies, follow-up inspections, the threat 
of mandatory recalls, a new voluntary im- 
provement pilot program for devicemakers, 
coordination with European regulators, and 
unspecified “regulatory and compliance mea- 
sures” conducted “behind the scenes.” 

Scott Gottlieb, Trump’s first FDA com- 
missioner, defended his record after review- 
ing a summary of Science’s data. “We were 
pretty aggressive,” he wrote in an email. “I 
don’t think you can paint us with a political 
narrative—that just because we were a Re- 
publican administration, somehow we must 
have ratcheted down enforcement activity. 
We didn’t.” 

Gottlieb, a physician and venture capital- 
ist before being tagged for FDA, resigned in 
March and is now at the American Enterprise 
Institute, a conservative think tank in Wash- 
ington, D.C. On his watch, Gottlieb said later 
in an interview, FDA made enforcement more 
efficient and expanded field resources. The 
agency's budget rose 21% from fiscal years 
2016 to 2019. He says that under Trump, FDA 
issued guidance or beefed-up enforcement 
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on stem cells, vaping, dietary supplements, 
homeopathy, opioids, and generic drugs. He 
argues that warning letters and other such 
actions are imperfect enforcement measures. 

The only significant exception to the down- 
ward trend on warning letters is FDA’s Center 
for Drug Evaluation and Research (CDER), 
which assesses and approves new drugs and 
polices existing drugs for safety and efficacy. 
It has issued 188 warnings under Trump, 
compared with 116 in the most recent period 
under Obama. Many of the Trump-period let- 
ters concern opioid sales, or over-the-counter 
and generic drugs made in China and India, 
which have rapidly gained market share 
despite quality lapses documented by FDA 
and others. 

Besides regulating consumer products, 
FDA protects subjects in clinical trials by re- 
viewing informed consent and other ethical 
practices. In the most recent period under 
Obama, CDER issued 19 warnings concern- 
ing improper conduct of clinical investiga- 
tions and protection of human subjects; 
under Trump the number fell to seven. 

Michael Carome, who directs health re- 
search at the Washington, D.C., consumer 
advocate Public Citizen, points to clinical 
trials at Hennepin County Medical Center 
in Minneapolis, Minnesota, as an example of 
FDA’ss sluggish approach to human subjects 
protections. In July 2018, Public Citizen and 
more than 60 clinical scholars and ethicists 
complained to FDA about apparent lapses 
in informed consent and possible coercion 
of unwitting and vulnerable patients treated 
with powerful experimental drugs, including 
ketamine. The anesthetic and pain reliever 
was being tested to control agitation. 

The next month, an FDA inspection of the 
trials validated some of Public Citizen’s con- 
cerns. After a second inspection, FDA cited 
other serious transgressions. As first reported 
by Minneapolis’s Star Tribune, these included 
failing to report to an oversight body that 
some patients developed serious breathing 
problems and movement disorders. 

In May, the medical center’s operator dis- 
puted most FDA findings, but agreed to make 
a few procedural changes. Nearly a year after 
the original complaint, FDA has yet to issue 
a warning letter. The agency declined to com- 
ment about the case. “It was among the more 
serious things I’ve seen in human subjects 
protection violations,’ says Carome, previ- 
ously a Department of Health and Human 
Services official tasked with protecting hu- 
man subjects. “Maybe [the FDA is] not taking 
appropriate action in some of these cases.” 

Even electronic cigarette enforcement 
has seen mixed results under Trump, de- 
spite Gottlieb’s public stand against teen 
vaping. On the one hand, a special pro- 
gram to police retail sellers of tobacco 
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and vaping products—such as convenience 
and grocery stores—issued about as many 
warnings and fines under Trump as at the 
end of the Obama years. Gottlieb says FDA 
put managers of large retail chains on no- 
tice to improve compliance. He also imple- 
mented a rule finalized under Obama that 


Watchdog off duty? 


FDA enforcement actions are dropping under Presi- 
dent Donald Trump. The number of warning letters 
and injunctions (through May 22) and Official Action 
Indicated (OA!) reports (through April 22) fell steeply 
from the equivalent periods before his inauguration. 
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Centers of inaction 
The decline in warning letters spans most FDA divisions. 
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A tale of two presidents 
Over a decade, warning letters and OAI reports rose and 
then waned, before a large Trump-era falloff. 
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gave the e-cigarette industry a road map to 
comply with federal tobacco laws. 

But warnings from the FDA Center for To- 
bacco Products, whose targets include whole- 
salers, online vendors, makers and suppliers 
of vaping liquids, and importers, dropped by 
one-third. Gottlieb’s FDA also delayed from 
2018 to 2022 an Obama-era deadline requir- 
ing e-cigarette-makers to submit products 
for premarketing review, reasoning that this 
would give the industry time to develop in 
ways that help adult smokers quit. (FDA re- 
cently advanced the deadline to 2021.) 

Federal data show vaping among high 
school students shot up 78% from 2017 to 
2018, and 48% among middle school stu- 
dents. Last year, FDA said it would restrict 
some sweet vaping flavors that entice chil- 
dren and began to enforce nicotine warn- 
ings on such products and crack down on 
improper marketing. But given the teen 
vaping epidemic, some critics view such 
moves as too little, too late. “Had we known 
there was going to be an epidemic of youth 
use ... we would have made different deci- 
sions” and moved more quickly, Gottlieb 
concedes. He says that delaying the review 
deadline helped to ensure adequate guid- 
ance for industry, and was not responsible 
for the youth crisis. 

Deregulatory pressure from the White 
House has had profound effects on several 
federal agencies, but whether and how it 
might explain the drop in some elements of 
FDA enforcement is unclear. The agency has 
few political appointees and is relatively de- 
centralized. Some insiders say Gottlieb gener- 
ally supported vigorous enforcement. But for 
some actions, including injunctions, FDA re- 
quires Department of Justice (DOJ) coopera- 
tion. Two people with personal knowledge of 
FDA enforcement, who decline to be named 
for fear of reprisals, say that under Trump, 
DOJ has blocked numerous strong injunc- 
tion requests from FDA. One source says it 
became much harder to bring new injunction 
cases after Trump took office. 

In its statement to Sczence, FDA did not ad- 
dress the allegations, but called DOJ a “strong 
partner.” DOJ did not respond to queries. 

Joshua Sharfstein, a health policy re- 
searcher at Johns Hopkins University in 
Baltimore, Maryland, who worked at FDA 
under Obama, including as its acting com- 
missioner, says he hopes the data found by 
Science will prompt a closer look at the ap- 
parent declines in enforcement, and help 
clarify what has caused them. “The FDA’s 
enforcement role is extremely important,’ 
he says. “If the FDA isn’t doing enforce- 
ment, nobody’s doing enforcement." 


This story has been supported by the 
Science Fund for Investigative Reporting. 
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Details revealed on NIH probe of foreign ties 


Official overseeing inquiry reveals undisclosed firings and returned funds from universities 


By Jeffrey Mervis 


yearlong investigation by the National 

Institutes of Health (NIH) in Bethesda, 

Maryland, has fingered 180 scientists 

at more than 60 U.S. research institu- 

tions who NIH believes have violated 

the confidentiality of its peer-review 
system or failed to disclose financial ties to 
foreign organizations. But NIH has revealed 
few details of the investigation—including 
how it began and what it has learned. 

Until now. Last week, the senior NIH of- 
ficial leading the agency's review of foreign 
ties talked with Science about the probe’s 
substantial, and growing, scope. 

This spring, two institutions—MD Ander- 
son Cancer Center in Houston, Texas, and 
Emory University in Atlanta—announced 
that five faculty members had been dismissed 
in the wake of the NIH probe. But Michael 
Lauer, who directs NIH’s $20 billion-plus 
extramural research program, says sev- 
eral other universities have cut ties with 
researchers in cases that have remained 
confidential. Lauer says some institutions 
have repaid NIH “hundreds of thousands of 
dollars” in grants after confirming what he 
termed “egregious” violations. 

Lauer understands why some universi- 
ties have kept quiet about those outcomes. 
“No organization wants to discuss person- 
nel actions in a public forum,” he says. At 
the same time, Lauer says NIH is, “to the 
extent possible, engaged in outreach” to 
the scientific community about the impor- 
tance of ensuring research integrity and 
defending against foreign influence. 

U.S. officials have become increasingly 
worried that foreign entities, especially 
those in China, are trying to manipulate 
the traditionally open U.S. research en- 
terprise to their advantage. Their fears in- 
clude the theft of intellectual property as 
well as economic and military espionage. 

Lauer suspects some of the violations 
NIH has uncovered will result in the U.S. 
government banning scientists from receiv- 
ing federal funds, a process called debar- 
ment. NIH has referred at least 18 cases to 
its parent agency, the Department of Health 
and Human Services, for further investiga- 
tion. “We don’t know where we'll end up,” 
Lauer says. But in the meantime, he adds, “I 
wouldn’t be surprised if other agencies fol- 
low our lead and start doing similar things.” 
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Michael Lauer of the National Institutes of Health 
says undisclosed foreign ties had gone on for years 
before the agency noticed them. 


Science could not independently verify 
some of Lauer’s comments because of the 
secrecy surrounding NIH’s investigation. 
But here are highlights of the conversation: 


¢ NIH’s current effort began after it 
learned in mid-2016 of a Federal Bureau 
of Investigation (FBI) case involving an 
MD Anderson researcher serving as an 
NIH reviewer. The FBI probe revealed 
that, despite strict rules to protect con- 
fidentiality, the researcher had shared 
grant proposals with several people. 
NIH has since found similar violations 
by other reviewers. 


¢ Most disclosure violations came to 
light through NIH program staff por- 
ing over annual progress reports from 
grantees. They noticed that scores of 
U.S. researchers had reported affili- 
ations with or funding from foreign 
institutions—particularly in China—in 
their research papers but failed to tell 
their home institutions or NIH about 
those arrangements. 


¢ A majority of the scientists who have 
failed to disclose foreign ties hold 
multiple grants from NTH. “Most [of 
the scientists] are ethnically Chinese,” 
Lauer says, “although some of our 
more serious cases are not.” The 
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researchers conduct studies in many 
fields, he adds. 


¢ Many U.S. universities initially pushed 
back after receiving a letter, telling 
NIH that the named scientists had no 
such connections and that NIH “was 
blowing smoke.” But those institutions 
later reversed course, Lauer says, after 
NIH supplied evidence such as grant 
numbers from foreign funders and 
employment contracts with foreign 
institutions. University officials were 
“surprised, shocked, and horrified” by 
the undisclosed ties, Lauer says. 


¢ Some scientists targeted by the probe 
had NIH grants that provided salary 
for 8 months of work each year at 
their home institution. But investiga- 
tions showed they had also agreed to 
work as much as 9 months a year at 
a foreign institution. A 17-month time 
commitment amounts to fraud, Lauer 
says. As a result, some institutions 
have returned salary funds to NIH. 


¢ Some U.S.-based scientists partici- 
pating in China’s Thousand Talents 
Program, which seeks to build ties 
with scientists working outside China, 
signed contracts that stipulated their 
research results “must stay in China 
and cannot be reported to their Ameri- 
can university.” Others had contracts 
that required them to promote their 
Chinese affiliations in any publication. 


“It’s fascinating,” Lauer says about the dual 
laboratories, overcommitments of time, 
and secret contracts by grantees that NIH 
says it has uncovered. The failure to report 
such arrangements “had been going on for 
a number of years, apparently. But it took 
a long time before we noticed it.” 

Many members of the Asian American 
community believe that U.S. agencies have 
been targeting Asian-born scientists simply 
for their participation in Thousand Talents 
and similar Chinese recruitment programs. 
But Lauer says NIH has no problem with 
US. scientists participating in Thousand 
Talents as long as they fully disclose that 
relationship. “Thousand Talents is not a 
threat [to the United States],” he says. “It’s 
not the specific conduct we are focusing 
on, it’s the failure to disclose it.” & 
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Low gravity and 
thick air will help 
the 300-kilogram 
Dragonfly hover. 


NASA to fly drone on Titan 


Dragonfly quadcopter will scout saturnian moon for life 


By Paul Voosen 


n Earth's moon and Mars, driving is 

the best option for a robot explorer. 

But on Titan, Ralph Lorenz plans 

to fly. Telescopes indicated Saturn’s 

largest moon had a thick, hazy at- 

mosphere, sodden with methane. In 
such an environment, the best way to get 
around, Lorenz floated in a 2000 New Sci- 
entist article, would be a helicopter. 

Two decades later, Lorenz’s idea has turned 
into Dragonfly, a $1 billion mission selected 
last week by NASA for launch in 2026. Upon 
arrival in 2034, the craft, a quadcopter drone 
the size of a car, will take flight, periodically 
touching down on the icy surface in search of 
a chemistry that could foster life. “It doesn’t 
get more interesting than a nuclear-powered 
quadcopter with drills and a neutron gun,” 
says Lorenz, Dragonfly’s project scientist at 
the Johns Hopkins University Applied Phys- 
ics Laboratory (APL) in Laurel, Maryland, 
which will manage the mission. 

NASA’s pick is inspired, says Lindy 
Elkins-Tanton, a planetary scientist at Ari- 
zona State University in Tempe. “Titan might 
truly be the cradle for some kind of life,’ she 
says. “Whether life has emerged or not, Ti- 
tan’s hydrocarbon rivers and lakes, and its 
hydrocarbon snow, make it one of the most 
fantasylike landscapes in our solar system.” 

Titan was first explored by NASA’s Cas- 
sini spacecraft. In 2005, it dropped the 
short-lived European Huygens probe into 
Titan’s atmosphere. The surface it found 
was Earthlike, with plateaus, dune-filled 
deserts, and liquid seas at its poles. But on 
Titan, where temperatures average a frigid 
94 K (-179° C), the “rocks” are made of water 
ice and the seas are filled with the hydro- 
carbons ethane and methane. Scientists 
believe the stew of organic molecules may 
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have reacted to form amino acids and the 
bases that, on Earth, make up DNA’s double 
helix. It’s as if Titan has been conducting 
experiments on the origins of life for mil- 
lions of years, says Elizabeth “Zibi” Turtle, 
a planetary scientist at APL and the mis- 
sion’s principal investigator. “Dragonfly is 
designed to go pick up the results of those 
experiments and study them,’ she says. 

Given Titan’s complex surface, a visit to a 
single site would not say much, so Dragon- 
fly will hop between distant landing sites 
every couple of weeks. Titan’s thick air and 
low gravity will allow the 300-kilogram cop- 
ter, powered by a radioactive generator, to 
hover much more easily than on Earth. 

The probe will target the moon’s vast 
equatorial deserts, which likely contain a 
grab bag of material. It will search for im- 
pact craters or ice volcanoes, places that 
could have provided an energetic spark— 
and the liquid water—needed for organic 
reactions. Ultimately, it will reach the 
80-kilometer-wide Selk crater, created by an 
impact large enough to melt Titan’s water- 
ice crust and liberate oxygen, priming reac- 
tions that may be recorded in its outcrops. 

During flight, Dragonfly can collect at- 
mospheric measurements. Upon landing, 
an instrument on the copter’s belly will 
fire neutrons at the surface and look for re- 
leased gamma rays to detect basic terrain 
types, such as ammonia-rich ice or carbon- 
rich sand dunes. Its two landing skids will 
each carry a drill to take samples and feed 
them to an instrument to analyze their 
composition. The craft will also be able to 
sense vibrations induced by the tidal forces 
of Saturn. Those tremors could help gauge 
the depth and composition of the ocean 
thought to lie beneath the moon’s crust. 

Dragonfly may last up to 8 years before 
its radioactive power source peters out. © 
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CRISPR patent 
fight revived 


U.S. patent office issues new 
interference declaration 


By Jon Cohen 


surprise ruling last week reignited 
the high-profile patent fight over 
who invented a key application of 
the genome editor CRISPR. The 
3-year-old battle pits parties rep- 
resented by the University of Cali- 
fornia (UC) against the Broad Institute 
in Cambridge, Massachusetts. It revolves 
around the use of CRISPR, originally de- 
rived from a DNA-cutting system used 
by bacteria, in the more complex cells of 
eukaryotes—including humans, which 
means the outcome could shape the devel- 
opment of potentially lucrative CRISPR- 
based medical treatments. 

Triggered by new patent claims from the 
UC group, the development resurrects con- 
tentious debates that calmed down after a 
court decision in September 2018 favoring 
the Broad Institute. The UC team, which 
represents Jennifer Doudna from UC 
Berkeley and her collaborators, had first 
published the use of CRISPR as a genome- 
editing tool in Science in June 2012, but 
they did not demonstrate that it worked 
in eukaryotes. 

In January 2013, Broad’s team, led by 
Feng Zhang, published a paper—also in 
Science—that described its success in mouse 
and human cells. After the U.S. Patent and 
Trademark Office awarded Broad several 
patents, UC requested what’s known as an 
interference, contesting Broad’s patents 
based on its own submitted patent. UC’s 
attorneys argued that after the Doudna 
group’s 2012 paper, it was “obvious” that 
CRISPR would work in eukaryotes. 

The Patent Trial and Appeal Board 
(PTAB) ruled against UC in February 2017, 
and the U.S. Court of Appeals for the Fed- 
eral Circuit denied UC’s appeal 1 year later. 
In April 2018, however, UC filed new pat- 
ent claims that highlighted its invention’s 
utility in eukaryotic cells, prompting PTAB 
to rule that the two parties’ claims now cre- 
ate an interference issue that needs to be 
examined in a hearing. 

Eldora Ellison, a lead attorney for the 
UC team—who works at Sterne, Kessler, 
Goldstein & Fox in Washington, D.C., says 
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Low gravity and 
thick air will help 
the 300-kilogram 
Dragonfly hover. 


NASA to fly drone on Titan 


Dragonfly quadcopter will scout saturnian moon for life 


By Paul Voosen 


n Earth's moon and Mars, driving is 

the best option for a robot explorer. 

But on Titan, Ralph Lorenz plans 

to fly. Telescopes indicated Saturn’s 

largest moon had a thick, hazy at- 

mosphere, sodden with methane. In 
such an environment, the best way to get 
around, Lorenz floated in a 2000 New Sci- 
entist article, would be a helicopter. 

Two decades later, Lorenz’s idea has turned 
into Dragonfly, a $1 billion mission selected 
last week by NASA for launch in 2026. Upon 
arrival in 2034, the craft, a quadcopter drone 
the size of a car, will take flight, periodically 
touching down on the icy surface in search of 
a chemistry that could foster life. “It doesn’t 
get more interesting than a nuclear-powered 
quadcopter with drills and a neutron gun,” 
says Lorenz, Dragonfly’s project scientist at 
the Johns Hopkins University Applied Phys- 
ics Laboratory (APL) in Laurel, Maryland, 
which will manage the mission. 

NASA’s pick is inspired, says Lindy 
Elkins-Tanton, a planetary scientist at Ari- 
zona State University in Tempe. “Titan might 
truly be the cradle for some kind of life,’ she 
says. “Whether life has emerged or not, Ti- 
tan’s hydrocarbon rivers and lakes, and its 
hydrocarbon snow, make it one of the most 
fantasylike landscapes in our solar system.” 

Titan was first explored by NASA’s Cas- 
sini spacecraft. In 2005, it dropped the 
short-lived European Huygens probe into 
Titan’s atmosphere. The surface it found 
was Earthlike, with plateaus, dune-filled 
deserts, and liquid seas at its poles. But on 
Titan, where temperatures average a frigid 
94 K (-179° C), the “rocks” are made of water 
ice and the seas are filled with the hydro- 
carbons ethane and methane. Scientists 
believe the stew of organic molecules may 
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have reacted to form amino acids and the 
bases that, on Earth, make up DNA’s double 
helix. It’s as if Titan has been conducting 
experiments on the origins of life for mil- 
lions of years, says Elizabeth “Zibi” Turtle, 
a planetary scientist at APL and the mis- 
sion’s principal investigator. “Dragonfly is 
designed to go pick up the results of those 
experiments and study them,’ she says. 

Given Titan’s complex surface, a visit to a 
single site would not say much, so Dragon- 
fly will hop between distant landing sites 
every couple of weeks. Titan’s thick air and 
low gravity will allow the 300-kilogram cop- 
ter, powered by a radioactive generator, to 
hover much more easily than on Earth. 

The probe will target the moon’s vast 
equatorial deserts, which likely contain a 
grab bag of material. It will search for im- 
pact craters or ice volcanoes, places that 
could have provided an energetic spark— 
and the liquid water—needed for organic 
reactions. Ultimately, it will reach the 
80-kilometer-wide Selk crater, created by an 
impact large enough to melt Titan’s water- 
ice crust and liberate oxygen, priming reac- 
tions that may be recorded in its outcrops. 

During flight, Dragonfly can collect at- 
mospheric measurements. Upon landing, 
an instrument on the copter’s belly will 
fire neutrons at the surface and look for re- 
leased gamma rays to detect basic terrain 
types, such as ammonia-rich ice or carbon- 
rich sand dunes. Its two landing skids will 
each carry a drill to take samples and feed 
them to an instrument to analyze their 
composition. The craft will also be able to 
sense vibrations induced by the tidal forces 
of Saturn. Those tremors could help gauge 
the depth and composition of the ocean 
thought to lie beneath the moon’s crust. 

Dragonfly may last up to 8 years before 
its radioactive power source peters out. © 
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CRISPR patent 
fight revived 


U.S. patent office issues new 
interference declaration 


By Jon Cohen 


surprise ruling last week reignited 
the high-profile patent fight over 
who invented a key application of 
the genome editor CRISPR. The 
3-year-old battle pits parties rep- 
resented by the University of Cali- 
fornia (UC) against the Broad Institute 
in Cambridge, Massachusetts. It revolves 
around the use of CRISPR, originally de- 
rived from a DNA-cutting system used 
by bacteria, in the more complex cells of 
eukaryotes—including humans, which 
means the outcome could shape the devel- 
opment of potentially lucrative CRISPR- 
based medical treatments. 

Triggered by new patent claims from the 
UC group, the development resurrects con- 
tentious debates that calmed down after a 
court decision in September 2018 favoring 
the Broad Institute. The UC team, which 
represents Jennifer Doudna from UC 
Berkeley and her collaborators, had first 
published the use of CRISPR as a genome- 
editing tool in Science in June 2012, but 
they did not demonstrate that it worked 
in eukaryotes. 

In January 2013, Broad’s team, led by 
Feng Zhang, published a paper—also in 
Science—that described its success in mouse 
and human cells. After the U.S. Patent and 
Trademark Office awarded Broad several 
patents, UC requested what’s known as an 
interference, contesting Broad’s patents 
based on its own submitted patent. UC’s 
attorneys argued that after the Doudna 
group’s 2012 paper, it was “obvious” that 
CRISPR would work in eukaryotes. 

The Patent Trial and Appeal Board 
(PTAB) ruled against UC in February 2017, 
and the U.S. Court of Appeals for the Fed- 
eral Circuit denied UC’s appeal 1 year later. 
In April 2018, however, UC filed new pat- 
ent claims that highlighted its invention’s 
utility in eukaryotic cells, prompting PTAB 
to rule that the two parties’ claims now cre- 
ate an interference issue that needs to be 
examined in a hearing. 

Eldora Ellison, a lead attorney for the 
UC team—who works at Sterne, Kessler, 
Goldstein & Fox in Washington, D.C., says 


5 JULY 2019 + VOL 365 ISSUE 6448 15 


NEWS | IN DEPTH 


PTAB ruled in 2017 that there was no inter- 
ference because Broad focused on eukary- 
otes and the UC patent made claims about 
the utility of CRISPR in “any environ- 
ment.” “What they said is, ‘We’re actually 
not going to have a fight at this point in 
time, because we think that these are two 
different inventions,” Ellison says. “They 
kind of kicked the can down the road on 
who was first to invent the use of CRISPR 
in eukaryotes.” 

Now that UC has sharpened its claim 
to the use of CRISPR in eukaryotes, Broad 
“(looks] forward to participating in the 
interference process,” according to a state- 
ment from the institute. Broad argues the 
new interference “challenges the validity 
of [UC’s] eukaryotic claims.” The ruling 
deems the Broad group the “senior party,” 
which means UC “carries the burden of 
proof” and must convince PTAB that the 
Broad team did not invent the eukaryotic 
use of CRISPR. 

Catherine Coombes, a patent attorney 
at HGF in York, U.K., who does not rep- 
resent Broad or UC but is involved in 
CRISPR patent disputes, says the new in- 
terference “adds to the complexity of the 
landscape.” The European Patent Office 
has granted what’s known as “overlapping 
rights” to both groups and others who 
have filed CRISPR patent applications, 


“For human therapeutics, 
it is still too early 
to know where the key 


patents will lie.” 
Catherine Coombes, HGF 


Coombes says—a temporary resolution 
in anticipation of future legal fights. “For 
human therapeutics, it is still too early to 
know where the key patents will lie,” she 
adds, noting that CRISPR systems vary, us- 
ing different enzymes that ultimately may 
make one version safer or more effective 
than another. 

The patent uncertainty created by the 
new interference declaration may prod 
UC and Broad to cut a deal, Coombes says. 
Broad says in a separate statement that it 
has long hoped UC would enter a “patent 
pool,” which effectively would allow both 
parties to earn money from their inven- 
tions without becoming ensnarled in legal 
wrangling. 

The parties will discuss the interference 
with PTAB in a conference call on 5 Au- 
gust. If they do not reach a settlement, the 
interference hearing is expected in about 
8 months. 
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Epigenetics comes to RNA 


Chemical tags on RNA drive cahicer or enhance memory 


By Ken Garber, in Chicago, Illinois 


he idea that chemical tags on genes 
can affect their expression without 
altering the DNA sequence, once sur- 
prising, is the stuff of textbooks. The 
phenomenon, epigenetics, has now 
come to messenger RNA (mRNA), 
the molecule that carries genetic informa- 
tion from DNA to a cell’s proteinmaking 
factories. At a conference here last month, 
researchers discussed evidence that RNA 
epigenetics is also critical for gene expres- 
sion and disease, and they described a new 
chemical modification linked to leukemia. 
Research has found that epigenetic marks 
decorate mRNAs like Christmas lights on a 
fence. The cell uses the marks “to determine 
where, when, and how much of the [associ- 
ated] protein should be generated,’ RNA bio- 
logist Pedro Batista of the National Cancer 
Institute (NCI) in Bethesda, Maryland, said 
at the conference. What’s more, says Michael 
Kharas of Memorial Sloan Kettering Cancer 
Center in New York City, mRNA modifica- 
tions “can affect the viability of cells, whether 
cells divide, cancer, neurologic diseases.” 
They are providing promising leads for drug 
developers. And, he adds, “There’s so many 
[more] diseases these things could be impor- 
tant in, ones people aren’t even looking at.” 
Modified mRNAs had been reported in 
the 1970s, but by 2008 they were largely 
forgotten. Then, Chuan He at the Univer- 
sity of Chicago, Samie Jaffrey at Cornell 
University, and Gideon Rechavi at Tel Aviv 
University in Israel took a fresh look. Their 
teams focused on one mRNA modifica- 
tion called m°A: a methyl group—a simple 
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chemical unit—attached to some of an 
RNA molecule’s adenine bases. He’s group 
showed that a well-known enzyme removes 
this mRNA modification, indicating that 
m°A has an important biological role, and 
Jaffrey’s and Rechavi’s groups developed 
mapping tools that showed it is widespread. 
Before the work, researchers knew mRNA 
epigenetic marks were there, but “they just 
didn’t know how to actually look for them,” 
says NCI researcher Shalini Oberdoerffer. 

Of at least half a dozen modifications of 
mRNA, m°A is the best studied. When pro- 
teins called readers attach to it, they direct 
the fate of the marked mRNA—which can 
vary dramatically. 

For example, m°A boosts gene expression 
needed for embryonic stem cells to properly 
differentiate into different cell types. But in 
blood stem cells, m°A restricts differentia- 
tion. In leukemia—a disease of blood stem 
cells gone awry—m°A sustains disease by 
keeping the cells in a stemlike state. In 2017, 
three groups, including Kharas’s, indepen- 
dently showed that eliminating the enzyme 
that places m°A on mRNA kills tumor cells 
in acute myeloid leukemia. At least three 
biotech companies are now developing ex- 
perimental drugs to block such enzymes. 

At the meeting, Tony Kouzarides of the 
University of Cambridge in the United 
Kingdom reported a new mRNA modifica- 
tion and an associated enzyme that drives 
leukemia. “I suspect there will be many, 
many more” links to leukemia, he said. 

M®A has also turned out to be critical in 
the brain. Through its readers, it controls 
the precise timing of new neuron formation 
during development in mice and enables 
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An enzyme (pink) places a chemical mark (gold) on 
messenger RNA (blue), in an artist’s concept. 


axons to regenerate after nerve injury. The 
modification also enhances memory. When 
He’s team knocked out the gene for an m°A 
reader in mice, the otherwise normal ani- 
mals had memory defects. Injecting a virus 
carrying the normal reader gene reversed 
the effect. And when the researchers chemi- 
cally stimulated the neurons to mimic the 
addition of a new memory, they saw a burst 
of protein synthesis that depended on m°A, 
they reported last year in Nature. 

Several years ago, Oberdoerffer followed 
a hunch that cells might use another simple 
chemical unit, an acetyl group, on mRNA. 
Her team reported last year in Cell that 
many mRNA cytosine bases are acetylated. 
The change boosts translation by stabilizing 
the molecules, and perhaps also by helping 
mRNAs match up with the correct transfer 
RNAs (tRNAs), the small RNA molecules that 
read the mRNA and add an amino acid to 
a growing protein chain. When mRNA and 
tRNA complement each other, they bind, 
triggering the addition of the amino acid. 
But the system isn’t exact—there are many 
more possible mRNA sequences than there 
are tRNAs, so tRNAs must somehow find 
(and bind to) some mRNAs that don’t match. 

Oberdoerffer’s team found a clue to the 
mystery: an acetylated mRNA base of- 
ten sits where a tRNA must recognize the 
mRNA despite a mismatch. The RNA mod- 
ification’s presence dramatically boosts 
gene translation, the researchers found. 
Oberdoerffer doesn’t think the modifica- 
tion is necessary for correct mRNA-tRNA 
recognition, but it may strengthen bind- 
ing. “I think we will learn that the genetic 
code as we know it is not a static entity,” 
she says. 

Like other fledgling areas of research, 
RNA epigenetics (also known as epitran- 
scriptomics) has its skeptics. In 2016, one 
group reported in Nature it had found a new 
modification, m'A, at more than 7000 sites 
across a cell’s complement of mRNAs. But 
a year later in the same journal, another 
group claimed that at most 15 mRNA m'A 
sites exist. “Because of that, everyone in 
the molecular biology community is a little 
bit suspicious about the validity of these 
[mRNA] modifications,” Jaffrey says. 

Other disputes rage over the functions 
of key enzymes and reader proteins. But 
epitranscriptomics is evolving fast. “We just 
need ... a lot more knowledge about these 
things,’ He says. “We need to stay open 
minded. The field is still very young.” 


Ken Garber is a journalist in 
Ann Arbor, Michigan. 
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DNA reveals European roots 


of the ancient Philistines 


3000-year-old burials identify the enemies of the biblical 
Israelites and shore up legends of Bronze Age migration 


By Ann Gibbons 


s a schoolgirl in Israel, Michal Feld- 
man learned that the ancient Philis- 
tines, who lived between present-day 
Tel Aviv and Gaza during the Iron 
Age, were “the bad guys.” In the Bible, 
they were the archenemies of the Isra- 
elites, who fought Samson’s armies and sent 
Goliath into battle against David. “Philistine” 
is still a slur for an uncivilized barbarian. 
Now a Ph.D. student in Germany, Feld- 
man has found a new way to understand 
the Philistines. By analyzing DNA from 
12th century B.C.E burials in the Philis- 
tines’s renowned city of 
Ashkelon, her team has 
found that they were in- 
terlopers in the ancient 
Middle East. Their clos- 
est known kin were from 
southern Europe, the 
team reports this week 
in Science Advances. 
The DNA data suggest 
a kernel of truth to Greek 
and Middle Eastern leg- 
ends that describe sur- 
vivors who moved south 
after the catastrophic 
collapse of great Bronze 
Age civilizations of the 
Mediterranean in the 
late 13th and early 12th 
centuries B.C.E. “This [DNA] story of migra- 
tion is tantalizingly close to those memories,” 
says co-author Daniel Master of Wheaton 
College in Illinois, who leads excavations in 
Ashkelon, Israel. “This is about real people 
who are moving from real troubles, finding 
new families in a new home,’ adds Assaf 
Yasur-Landau, an archaeologist at the Uni- 
versity of Haifa in Israel who was not part of 
the study. “It’s the most basic human story.” 
Archaeologists have known for a century 
that the distinctive ceramic pots and other 
artifacts that suddenly appeared in 12th 
century B.C.E. Philistine cities resemble 
artifacts from the Mycenaean empire of 
Greece, the ancient power that, according 
to myth, battled Troy. Egyptian hieroglyph- 
ics depict a sea battle with people from the 
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Infant burials beneath Philistine houses 
in Ashkelon, Israel, yielded ancient DNA. 


north whom 19th century scholars called 
the “Sea Peoples.” But other scholars think 
Philistine culture spread when ancient em- 
pires in Turkey and Syria declined and local 
people filled the void. 

Master invited Feldman’s adviser, paleo- 
geneticist Johannes Krause of the Max 
Planck Institute for the Science of Human 
History in Jena, Germany, to try to extract 
DNA from the teeth and inner ear bones of 
skeletons excavated in Ashkelon. The team 
analyzed 1.24 million sites across the ge- 
nomes of 10 skeletons. Three of the oldest 
individuals, who lived 3500 to 3700 years 
ago, were not distinguishable genetically 
from local Levantine 
people. But DNA from 
four infants buried be- 
neath the earthen floors 
of homes in Ashkelon 
500 years later, when 
Philistine culture first 
appears, told a different 
story. They had inher- 
ited 25% to 70% of their 
DNA from southern Eu- 
ropean ancestors, and 
the closest matches were 
to ancient people from 
the Aegean, Sardinia, 
and Iberia. The remain- 
ing DNA was from local 
people, suggesting their 
European ancestors had 
quickly mated with their new neighbors. In- 
deed, two styles of pottery in neighboring 
houses suggest that Philistines and Levan- 
tines lived side-by-side in Ashkelon. 

Just 200 year later, however, the DNA of 
three adults, presumably Philistines, fully 
matched that of local Levantine people. 
Intermarriage had swamped the genetic 
heritage of the European immigrants, 
Krause suggests. 

With the study “we finally have real scien- 
tific proof that people moved into Ashkelon 
from Europe,” says Kristian Kristiansen, an 
archaeologist at the University of Gothen- 
burg in Sweden, who suspects they hailed 
from Italy. But it will take ancient DNA 
from across southern Europe to pinpoint 
their homeland. & 
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The ethanol in Rob McGinnis’s 
gloved hands was synthesized 
in the device behind him. 
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energy to turn carbon dioxide and water into gasoline 


UEST FOR FIRE 


Rob McGinnis aims to use renewable 


By Robert F. Service, in San Francisco, California; Photography by LiPo Ching 


nawarm March day here, you could 
almost mistake Rob McGinnis 
for a huckster newly arrived in 
a frontier town as he delivers a 
rapid-fire pitch to an audience of 
thousands of would-be investors. 
McGinnis, a chemical engineer and 
entrepreneur, isn’t hawking snake 
oil, however: His elixir is gasoline. 
Nearly everyone in the developed world is 
hopelessly addicted to it. Collectively, we use 
nearly 3 trillion liters every year. 

At the pitch fair, McGinnis wears the Sili- 
con Valley entrepreneur uniform of jeans, a 
black T-shirt, and black leather biker boots. 
On a theater-size stage, he delivers his spiel, 
sandwiched between 3-minute presentations 
for an online personalized clothing store 
and an outfit that would rent scooters by the 
month. “We make gasoline from air, water, 
and electricity,’ McGinnis announces. “Today, 
gasoline sells for $3.50 a gallon in California. 
Next year, we will be selling it for $3 per gal- 
lon.” Other startups peddling ideas at the fair 
foresee markets in the billions, but McGinnis 
aims higher. “We’re talking about a $2 trillion 
[per year] gasoline market,” he says. 

If all that sounds too good to be true, it 
might be. “I hope they’re right,” says Olgica 
Bakajin, CEO of Porifera Inc., a San Leandro, 
California, company that has also worked on 
systems like those at the heart of McGinnis’s 
fuelmaker. But she notes that McGinnis “is a 
good talker who sells things well.” 

He has convinced some powerful investors. 
In December 2018, he received $150,000 from 
Y Combinator, the Mountain View, California, 
seed funder hosting the pitch fair, to build a 
prototype of his air-to-gasoline-maker. The 
result was a refrigerator-size contraption of 
catalysts, tubes, electronics, and filters, as- 
sembled a week before the pitch fest. 

But before the demo, the machine sprang a 
leak. Although it wasn’t operating at the pitch 
fest, McGinnis’s optimism was. He promised 
audience members that the repaired device 
would extract carbon dioxide (CO,) from the 
air, add it to water, and use a catalyst to re- 
arrange the chemical bonds to make hydro- 
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carbons. The result: fossil fuel without the 
fossils. “It can sound like magic, but it’s really 
just chemistry,’ McGinnis told the audience. 


SYNTHESIZING GASOLINE, instead of refining 
it from oil, isn’t a new idea. German chemists 
in the 1920s discovered they could turn coal 
into carbon monoxide (CO) and hydrogen—a 
combination known as synthesis gas. Cata- 
lysts, along with heat and pressure, could 
then transform synthesis gas into gasoline 
and other liquid hydrocarbons. 

But McGinnis’s setup requires no heat, 
pressure, or coal. It uses only air, water, and 
electricity, which can come from the sun or 
wind. And with those renewable resources 
becoming ever cheaper, he’s betting he can 
deliver gasoline more economically—and 
far more cleanly—than companies that must 
find oil, drill for it, ship it, and refine it. 

Several other startups and academic labs 
are pursuing the same dream. “There has 
been a lot of progress in the last few years” in 
turning CO, into more-complex compounds, 
says Peidong Yang, a pioneer in the field at 
the University of California, Berkeley. 

Yet many of those efforts have stumbled 
over the expensive, energy-intensive steps 
needed to separate the hydrocarbons from 
the water they are produced in. Prometheus 
relies instead on a proprietary carbon nano- 
tube membrane sieve that it says readily 
parts the hydrocarbons from water. “If they 
indeed have a low-energy separation process, 
that solves a big problem,” Yang says. 

“Rob’s approach has a good chance of 
competing with fossil fuels,’ says Matthew 
Eisaman, a physicist at the State University 
of New York in Stony Brook who consults for 
Prometheus. Eisaman ran a now-mothballed 
research program at Google’s research arm, 
Google X, that aimed to turn the CO, in sea- 
water into liquid fuels. 

Bruce Hinds, a nanotube membrane ex- 
pert at the University of Washington in Se- 
attle, says McGinnis’s published results on 
his separation technology inspire confidence 
that the approach could work. “I’m highly en- 
couraged,” Hinds says. 
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McGinnis has long defied expectations. Af- 
ter high school, he enlisted in the Navy, which 
sent him to Bahrain during the first Gulf War. 
There, he cleared mines from battlefields and 
harbors. “I didn’t want to do it,” he says of 
his time in the service, but he needed money 
for school. He enrolled in Cabrillo College, a 
public community college near Santa Cruz, 
California, where he dreamed up an energy- 
efficient approach to desalinating water 
during a chemistry class. Today, most water 
desalination uses reverse osmosis, which em- 
ploys energy to push water through a mem- 
brane that excludes salts. McGinnis planned 
instead to use forward osmosis, which relies 
on differences in the concentration of com- 
pounds on either side of a membrane to move 
water across; it consumes only about half as 
much energy. “I prototyped it in my kitchen,” 
he says. “I wanted something to distinguish 
myself to [transfer] to a really good school.” 

His school plan worked, and McGinnis 
moved to Yale University for his junior year. 
He majored in theater and focused on play- 
writing. “It was really good training,” he says. 
“A lot of what entrepreneurs do is tell stories.” 

But he never gave up on science. 
McGinnis canvassed the chemistry faculty 
to see whether anyone would let him use lab 
space in the evenings to pursue desalination. 
Menachem Elimelech, a Yale chemical engi- 
neer, agreed. McGinnis later pursued a Ph.D. 
in Elimelech’s lab, constructed a forward 
osmosis demonstration, and helped launch 
a startup company called Oasys Water in 
Cambridge, Massachusetts, to commercialize 
the technology. Oasys built five large water 
treatment plants in China and was eventually 
bought out by its customer there. 

Even before the buyout, McGinnis felt side- 
lined from decision-making. He left Oasys in 
2012 to explore other ideas for improving 
membranes for separations, initially looking 
for even better ways to purify water. He chose 
to pursue membranes made from thin plas- 
tic sheets shot through with myriad carbon 
nanotubes—tiny hollow tubes made entirely 
from carbon atoms. Researchers at Lawrence 
Livermore National Laboratory in Califor- 
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Making fuel out of thin air 

At the heart of a new fuelmaking machine are pipes shot through with tiny carbon mesh 
tubes that filter fuel from water using little energy. The fuel comes from an electrochemical 
process that combines water (H,0) with carbon dioxide (CO,) from the air. 


Graphic by Chris Bickel 


Combustion in reverse 
Electricity and a copper surface convert CO, and H,0 into 
ethanol, an alcohol that can link into longer fuel molecules. 


1 CO, reacts 
with H;0 to 
form carbonic 
acid (H,CO;). 


Copper surface e 


2 Electricity 
converts carbonic 
acid to carbon 
monoxide (CO) 
and hydroxide 
(OH-) on copper 
surface. 


3 CO and H,0 
react at copper 
surface to 
make ethanol. 


4 Zeolite catalyst 


converts multiple cd an 
alcohols Zeolite 
to larger fuel catalyst 
molecules. 
Fuel 
The ultimate filter 


Cheap plastic sheets house trillions of carbon nanotubes 
per square meter, which zip alcohols through (orange) 
while blocking H,0 (blue). 


Outside 


Inside 


Carbon nanotubes 
For filtering fuel, the best hollow carbon tubes 
are about 1 nanometer (nm) in diameter. 


Atomic pattern 
Connected rings of carbon 
atoms roll up into hollow tubes. 
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Filter 


A prototype fuelmaker 
Prometheus's machine houses the entire 
process, from CO, capture to ethanol 
synthesis. If the device is powered with 
renewable electricity, using the fuel releases 
zero net greenhouse gases. 


Separating fuel from water 
A mix of H,0 and fuel enters the filter. 
Fuel travels through the nanotubes 
into the core of the fibers, where a 
vacuum draws it out. 


ae 


Flowin 


1.5 millimeters 


Filter fibers work together 
Long, thin filters are bundled 
to scale up the filter. 
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nia and elsewhere had shown that the tiny 
channels allow water to pass through while 
blocking other molecules. But the lab dem- 
onstrations employed dime-size membranes; 
larger membranes leaked and _ weren’t 
uniform. “Carbon nanotube membranes 
never lived up to the hype,’ says Jeffrey 
McCutcheon, a membrane separation expert 
at the University of Connecticut (UConn) in 
Storrs who collaborates with McGinnis. 

McGinnis thought he could solve the prob- 
lem. He formed a company called Matter- 
Shift and received lab space in a startup in- 
cubator at UConn. After 5 years, he could 
make uniform membranes the size of a 
sheet of paper, consisting of carbon nano- 
tubes embedded in a cheap commodity plas- 
tic called polyethersulfone. 

The key, McGinnis says, was figuring out 
how to align vast numbers of nanotubes— 
roughly 2.5 trillion per square meter—so that 
most pierce the membrane perpendicularly, 
as ion channels on a cell surface do. Some 
researchers have speculated that magnetic 
fields are key to the process, but McGinnis 
isn’t saying. “It’s our secret sauce,” he says. 

In March 2018, he, McCutcheon, and col- 
leagues reported one possible use for their 
membranes in Science Advances: filtering out 
organic contaminants, such as odor-causing 
compounds, from water, while applying only 
a fraction of the pressure used to push wa- 
ter through reverse osmosis membranes. But 
McGinnis sees that use as a demonstration 
rather than his primary commercial target. 

In the past year, he and his team have come 
up with a way to transform their membranes 
from flat sheets into narrow hollow plastic 
fibers dotted with nanotube pores. The re- 
searchers can manufacture those fibers in a 
continuous process, McGinnis says, cutting 
them to any length and bundling them to 
make industrial filters. 


THE NANOTUBE FILTERS can perform a far 
more important feat than removing contami- 
nants, McGinnis says: They separate ethanol 
from water. Carbon nanotubes of the right 
diameter—about 1 nanometer—transport 
ethanol more quickly than water through 
their interior. McGinnis explains that etha- 
nol’s carbons have an affinity for the inside of 
the carbon nanotubes. So if the starting liq- 
uid contains at least 5% to 10% ethanol and a 
slight vacuum draws it through the filter, the 
alcohol molecules form a molecular conga 
line through the nanotubes, excluding nearly 
all water. The filtered solution winds up con- 
taining about 95% ethanol. 

McGinnis and his team haven’t published 
those separation results yet. But researchers 
led by Yang Decai from Dalian University 
of Technology (DUT) in China reported in 
August 2018 in Nano Letters that a similar 
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carbon nanotube membrane was highly se- 
lective and fast at separating ethanol and bu- 
tanol (another alcohol) from water. 

If commercialized, such membranes could 
benefit biofuel companies that make ethanol 
from corn, McGinnis and others say. Fer- 
mentation leaves a solution of 10% ethanol 
in water. Today, ethanol producers use heat 
and 6-meter-tall distillation columns to boil 
off the ethanol, an energy-intensive process 
that costs about a third as much as the al- 
cohol itself. McGinnis says his membranes 
could cut the distillation cost by 90% in an 
ethanol market worth $50 billion per year in 
the United States. 

If the membranes work as claimed, “That 
by itself would be big,” Peidong Yang says. 

McGinnis is working to prove that they do. 
Last year, MatterShift and partners at UConn 
received a $900,000 grant from the Depart- 
ment of Energy to demonstrate their etha- 
nol separation technology. Their test uses a 
2-meter-high tube with 1400 nanotube-pocked 
fibers, with results expected this summer. 

If either MatterShift’s or DUT’s membranes 
prove durable and long-lived, bioethanol pro- 
ducers should represent an eager market, 
says David Sholl, a chemical engineer at the 
Georgia Institute of Technology in Atlanta. 
McGinnis has already founded a company 
called MatterShift Biofuels to commercial- 
ize the technology. But he envisions a bigger 
future for his membranes: not just filtering 
ethanol fermented from corn or sugar, but 
also purifying fuel made from the air itself. 

Synthesizing the fuel is the easy part. 
Peidong Yang’s team and groups at Oak 
Ridge National Laboratory (ORNL) in Ten- 
nessee and the University of Illinois in Ur- 
bana have published papers in the past 
3 years showing that electricity and nanosize 
copper catalysts can turn CO, and water into 
a mix of alcohols. And startups including a 
New Orleans, Louisiana, company called 
ReactWell are pursuing related approaches. 

Thus far, the ORNL team has reported the 
highest efficiency, turning 23% of the electri- 
cal energy into fuel. But all the groups using 
the approach to make alcohols face the chal- 
lenge of separating the fuel from the water. 
McGinnis says his membranes are the an- 
swer. They are “the new piece in the puzzle 
no one else has.” 

In the air-to-fuel machine he hoped to 
demonstrate at Y Combinator, the mem- 
branes filter a liquid that flows from a 
meter-wide chamber containing two elec- 
trodes dunked in water. When air blows 
through the chamber, the CO, it contains 
reacts with water, producing carbonic acid— 
the same molecule acidifying the oceans. 
That acid, in turn, reacts on a copper catalyst 
coating the negative electrode, or cathode, to 
create CO. The cathode also strips protons off 
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water molecules, leaving behind negatively 
charged hydroxide ions. Those ions travel 
to a positively charged electrode, or anode, 
where they react to form water and oxygen 
gas. Meanwhile, at the cathode, multiple CO 
molecules and protons are transformed into 
ethanol and other alcohols. 

The result is the alcohol and water mixture 
that goes through the nanotube fibers. Pro- 
metheus has repaired its machine since the 
pitch fair, and it produces “a pretty steady 
drip” of fuel, McGinnis says: 10 milliliters per 
hour of alcohol that trickles out a red valve 
in the back. Over the next month, McGinnis 
and his colleagues plan to increase the size of 
their electrodes and catalysts to raise the pro- 
duction rate to 50 to 100 milliliters per hour. 

Ultimately, McGinnis plans to add a sec- 
ond catalytic step using commercially avail- 
able catalysts called zeolites, which would 
convert the mix of alcohols to the larger 
hydrocarbon molecules found in gasoline. 
“All of the pieces of this process have been 
proved to work. But no one has put them all 
together,” he says. “Until now.” He expects the 
device, when optimized, to produce 20 liters 
of gasoline per week. 

Once the machine is working efficiently, 
electricity will make up about one-third of its 
operating costs. Renewable electricity prices 
around the globe are falling, however, and 
they already sink near zero at certain times 
of the day in places where the sun blazes or 
the wind howls. Prometheus, McGinnis says, 
could easily ramp its electricity demands 
up and down to take advantage of the low- 
est rates, and the machines could be sited 
wherever renewable power is cheapest. Next 
year, the company plans to build a $500,000 
shipping container-size demonstration plant 
that can produce hundreds of thousands of li- 
ters of fuel per year. And last month, it inked 
its first deal, to begin to sell carbon-neutral 
fuel to Boom Supersonic, a Denver company 
building a supersonic commercial airliner. 

Even if all goes according to plan, McGinnis 
will face a long road to compete with the 
likes of ExxonMobil. He’ll have to prove he 
can build a fuelmaker cheaply enough to 
make its gasoline affordable. That could be 
tough if turning it on makes sense only when 
renewable electricity prices bottom out. The 
fuelmaker also works only with a source of 
clean water. And before he can market his in- 
vention, he’ll need to prove that his fuels can 
directly substitute for fossil-derived versions. 

At the pitch fair, McGinnis stands next 
to his prototype and repeats his story for a 
steady stream of potential investors. “We 
want to replace all fossil gasoline,” he says. 
“That would make the world a better place.” 
A few hours later, he loads up his wares and 
moves on in the hope that one day his dreams 
turn into an energy revolution. 


5 JULY 2019 * VOL 365 ISSUE 6448 21 


= 


LETTERS 


NEXTGEN VOICES 


Pithy burnout prevention 


We asked young scientists this question: In exactly six words, what is the 
secret to preventing burnout? In their succinct responses, scientists from 
around the world suggested ways to prioritize breaks, focus on the positive, and 
find inspiration in work, hobbies, and community. Our respondents urged re- 
searchers facing burnout to take care of themselves, whether that means setting 
boundaries or giving themselves permission to splurge. Follow NextGen on 
Twitter with hashtag #NextGenSci. See all NextGen Voices results at 


https://science.sciencemag.org/collection/nextgen-voices. 


Invest in hobbies 


Improve a skill unrelated 
to research. 
Ming-Ju Amy Lyu, China 


Walk while listening 
to soothing songs. 
Jian Zhang, China 
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Incorporate parkour 
tricks into coffee runs. 
Raf Aerts, Belgium 


Channel exhaustion 
to creativity; 

art works! 

Shruti Sharma, USA 


—Jennifer Sills 


Sing loudly! 

It’s socially 
acceptable screaming. 
Tiffany Phu, USA 


Rescue a dog, 
catalyst for joy. 
Matt Joseph Kuhn, USA 
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Regenerate neurons 
by exercising. Start yesterday. 
Athanasia Nikolaou, Germany 


Do something that 
brings joy daily. 
Carson Alice Wills, USA 


Beer tastings count 
as field research. 
Beth McKinnon Adamowicz, USA 


Watch kitten videos 
during conference calls. 


Aric S. Campling, USA 


Cultivate community 


Chilling with friends 
makes me fireproof. 
Vanessa Karina 

Alves da Silva, UK 


Work with people who 
respect you. 
Marion White, USA 


Tell your supervisor about 


your feelings. 
Clara del Junco, USA 
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Intentionally cultivate honest, 
supportive work friendships. 
Rosalie Shinwei Doerksen, USA 


Margaritas with temporary 
friends at conferences! 
Heather Burkin, USA 


Recognize other people's 
confidence in you. 
Jonathan Marquez, USA 


Coffee with lab friends is 
indispensable. 
Joel Henrique Ellwanger, Brazil 


Team sports and friends 
outside academia. 
Andrés S. Rigual Hernandez, Spain 


Cook elaborate meals 
for your friends. 
Leah Cairns, USA 


Take your time and get support. 


Ingrid Zuniga, Mexico 


Read bedtime stories 
to the kids. 
Ben de Haas, Germany 
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Set intentional goals 


Find permanent positions 
with permanent funding! 
Sacha Escamez, Sweden 


Find work that fits your 
personality. 
Robert P. Tett, USA 


Share long-term ambitions 
with collaborators. 
Yuta Igarashi, Japan 


Define your target. 
Focus on it. 
Allison Matia, USA 


Set relevant, achievable, 
and short-term goals. 


Marcela Viviana Nicola, Argentina 


Divide big projects into 
smaller tasks. 


Lin Wang, France 


Celebrate success 


A glass of beer after 
brainstorming. 
Luiz H. Varzinezak, Brazil 


Dance, listen to music 
when stressed. 


Karishma Shaik, /ndia 


Impromptu, solo dance 
parties to Toto. 
Déna Jansen, South Africa 


Recurring break to garden 
and hike. 
Aminata Coulibaly, USA 


Learn to enjoy the small 
successes. 
Satyanarayana 
Subrahmanyam, India 


Remember the wins 
and stay grateful. 
Caitlin M. Aamodt, USA 


Acknowledge and celebrate 
team's progress weekly! 
Salma Kaochar, USA 


Delight in the success 
of colleagues. 
Catherine Carbone, USA 


Find meaningful reward 
in the journey. 
Carol Lee Wilkinson, USA 


Maintain perspective 


Try your best. Let it be. 
Junxiang Cheng, China 


Remember! 
You are not your job. 
Rachel Hale, New Zealand 


Keep the big picture in mind. 
Sarah Ch’ng, Australia 


Recall why and how you 
started. 
Jing Woei Li, Hong Kong 


Have something more 
important in life. 

Amanda Linnea Karolina Jonsson, 
USA 


Love yourself more than 
your work. 
Mallory Rose Peterson, USA 


Recognize burnout’s 
systemic, not personal failure. 
Rachel Yoho, USA 


See the inner child in everyone. 
Joseph Michael Cusimano, USA 


Focus on the joy of discovery. 
Richard Gaughan, USA 


Bonfire: Reviewer 2's com- 
ments as kindling. 


Colin Murphy, USA 


Prioritize self-care 


Mindful breathing, 15 minutes 
every day. 
Oriana Jovanovi¢, Hong Kong 


Sleep! Even Einstein got 10 
hours. 
Josiah Dykstra, USA 


Appropriately titer your 
caffeine:ethanol ratio. 
Kelvin Yen, USA 


Invest in yourself and 
experiments equally. 
Victoria Tokarz, Canada 


Know limits; take care 
of yourself. 
Ruwansha Galagedara, Sri Lanka 


Resting creates space for 
great ideas. 
Ingrid Olivares, UK 


Internet-free day keeps 
burnout away! 
Martin Schwarzer, Czech Republic 


Regular breaks, including 
a postlunch nap. 
Rambabu Korrapati, /ndia 


Talk with a mental health 
professional. 
Carl Davidson Gandola, USA 


Set boundaries 

Delegate all tedious work to 
minions. 

Thomas Alejandro Berrueta, USA 
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Work actual contract hours, 
not weekends. 
Klaudija Daugelaite, France 


Honor and respect work-life 
balance. 

Shahienaz Hampton, United Arab 
Emirates 


Remember, “breakthrough” 
starts with a “break.” 
Anne-Sophie Hafner, Germany 


Thou shall not do it all! 
Mac Kevin Ella Braza, Philippines 


Realize long hours don't equal 
productivity. 
Edward Lau, USA 


Do not take on unnecessary 
responsibilities. 
Evrim Fer, Turkey 


The secret: Learn to say “No.” 
Maria Romina Schiaffino, Argentina 


Refuse to do what doesn’t 
matter. 
Shervin Fatehi, USA 


Splurge 
Indulge in guilty pleasures 


without remorse. 
Yong Tang, USA 


Give yourself permission 
to be selfish. 
Howard M. Huynh, Canada 


Ignore serving size 
suggestions on snacks. 
Do Soon Kim, USA 


Eat cake...Didn't work? 
Eat more! 
Michael Patrick Schwoerer, USA 


Sugar is brain food. Keep 
snacking. 
Emily Royse, USA 


Coffee before lab. Goodbye, 
steady hands! 
Mark Martin Jensen, USA 


Ditch that coffee and just 
snooze. 
Charles Teta, South Africa 


Clean house. Just joking. 
Netflix binge. 
Ken Dutton-Regester, Australia 


Conference in Hawaii. 
Don't attend talks. 
Soham Saha, USA 


Deep breathing 
(if inhaling food counts) 


Dhruv Patel, USA 


10.1126/science.aay2936 
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Restored and regenerated forest 

- inthe Reserva Ecoldgica de Guapiacu, 
Brazil illustrates how forests can 

~= regain maximum potential canopy cover. 


Restoring forests as a means to many ends 


An urgent need to replenish tree canopy cover calls for holistic approaches 


By Robin Chazdon and Pedro Brancalion 


arth is approaching environmental 
thresholds that, if crossed, will cre- 
ate serious disruptions to ecosystems, 
economies, and society (1). To avoid 
the devasting effects of climate change 
and biodiversity loss, humanity must 
protect and restore native ecosystems (2). 
International conventions and organizations 
support forest restoration as a method for 
mitigating hazardous environmental shifts, 
but questions remain as to where and how to 
focus such restoration efforts. On page 76 of 
this issue, Bastin et al. (3) de- 
scribe a new approach that 
advances our understanding 
of global tree distribution. 
Prevailing views on how 
to assess forest cover and 
restoration opportunities 
might actually hinder rather 
than promote the large-scale 
action needed. Traditional 
methods for evaluating 
changes in forest cover are 
based on a dualistic (forest/ 
nonforest) classification of 


Restoration 
outcomes 


: * Scaling up 
land use that applies a sub- * Accountability 
jective threshold to the 0 to —.. Innovation 


100% canopy-cover gradient 
(4). Bastin et al. move beyond 
this strict dichotomy with 
the use of direct photo-in- 
terpretation measurements 


that account for wooded Capital stocks 


: . + Natural 
lands that lie outside forest. yiman 
areas. The authors identify — . \anufactured 
the global potential for en- * Social 
hanced tree canopy cover * Financial 


on the basis of variations in 
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natural levels of tree cover, which has imme- 
diate implications for restoring forest cover. 
Test scenarios that assess how future climate 
change will affect global canopy cover em- 
phasize the urgency of taking action now, 
as shifting responses of forests to climate 
change are already under way and will be- 
come far more pronounced. 

This urgency of restoring tree cover does 
not justify short-term fixes that fail to pro- 
duce environmentally and socially benefi- 
cial long-term outcomes. An assessment of 
the biophysical capacity for restoring global 
tree cover provides a necessary but insuf- 


Restoration systems deliver multiple benefits 

A forest restoration system integrates structures (trunk and roots) that assimilate and 
distribute resources (soil) to initiate and sustain restoration outcomes (branches). The system 
ultimately delivers myriad benefits for nature and humanity (fruits). Restoration outcomes 
and benefits sustain the system, stabilize enabling conditions, an 


Outputs 


d increase capital stocks. 


ficient foundation for evaluating where tree 
cover can be feasibly increased. The kinds 
of trees as well as how and where they are 
grown determine how and which people ben- 
efit. In some contexts, increasing tree cover 
can elevate fire risk, decrease water supplies, 
and cause crop damage by wildlife. Refores- 
tation programs often favor single-species 
tree plantations over restoring native for- 
est ecosystems. This approach can generate 
negative consequences for biodiversity and 
carbon storage (5), threaten food and land se- 
curity, and exacerbate social inequities. How 
restored lands are governed determines how 
reforestation costs and ben- 
efits are distributed. 
Without attention to 
these social and environ- 
mental issues, efforts to rap- 
idly increase tree cover are 
likely to prove ineffective 
in the long term. Initiated 
by Germany and the Inter- 


* Social mobilization 
* Capacity building 
* Cost-effectiveness 


Restoration 
initiatives 


Inputs 
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r Restoration benefits 

+ Climate mitigation 
and adaptation 

+ Biodiversity 
conservation 

* Ecosystem services 

+ Local livelihoods 

* Economic gains 

* Food security 

L. + Well-being 


Enabling conditions 

+ Effective governance 

+ Institutions 

* Supply chains 

¢ Multisector alignment 

+ Public-private 
partnerships 

+ Stakeholder 

L engagement 


national Union for Conser- 
vation of Nature in 2011, 
the Bonn Challenge has 
garnered 58 commitments 
to restore 170 million hect- 
ares of forest land by 2030. 
The Global Deal for Nature 
(6) proposes that 30% of 
Earth’s surface be formally 
protected and an additional 
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20% designated as climate stabilization ar- 
eas by 2030, to stay within 1.5°C of global 
temperature rise. 

Reaching ambitious restoration targets 
appears unlikely, however, without practical 
guidelines and tools, institutional alignment, 
accountability mechanisms, and a robust evi- 
dence base. Enormous gaps remain between 
high-level focus on restoration and imple- 
mentation on the ground (7). Forest resto- 
ration programs are largely conducted on 
project scales by using approaches that are 
neither cost effective nor designed to achieve 
multiple benefits and long-term sustainable 
outcomes (8). In contrast to such site-based 
practices, forest and landscape restoration is 
a holistic approach; it aims to balance diverse 
types of tree cover to achieve multiple ben- 
efits, based on the local socio-ecological con- 
text and stakeholder engagement (9). This 
landscape approach is vital to reaching the 
global scales needed to reverse the effects of 
deforestation and land degradation. 

Forest restoration is a mechanism to 
achieve multiple goals, including climate mit- 
igation, biodiversity conservation, socioeco- 
nomic benefits, food security, and ecosystem 
services. A living tree helps to illustrate the 
components of a holistic forest restoration 
system (see the figure). The above-ground 
portions of the tree are the trunk (restoration 
initiatives), branches (restoration outcomes), 
and fruits (restoration benefits). Below the 
ground are soil (capital stocks) and root 
structures that assimilate and distribute re- 
sources and create enabling conditions. 

Viewing restoration as a system also 
guides appropriate mechanisms to mobilize 
resources in cost-effective ways. One such 
mechanism is to use the best available spatial 
information to identify areas where the most 
beneficial and feasible restoration outcomes 
intersect (10). Bastin et al. clearly show that 
we have a narrow window of time in which 
to restore global tree cover. We need to act 
quickly, intelligently, holistically, and globally. 
Doing so will require planting and sustaining 
of restoration systems wherever new types of 
canopy cover are needed. & 
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GENOME EDITING 


Inserting DNA with CRISPR 


Inserting large DNA segments with CRISPR holds great 
promise for genetic engineering 


By Zhonggang Hou and Yan Zhang 


ost prokaryotes rely on_ the 
CRISPR-Cas system for adaptive 
immunity against viruses and mo- 
bile elements (J, 2). Small RNAs 
produced from CRISPR direct Cas 
effector proteins to seek and de- 
stroy nucleic acids from invaders that have 
complementary target sites (3). There are 
multiple types of CRISPR, which are de- 
fined on the basis of their protein composi- 
tion. Recently, RNA-guided nucleases from 


pends on the repair of this DSB by the en- 
dogenous pathways in the host cell. Repair 
by end-joining DNA repair pathways often 
predominates and tends to introduce het- 
erogeneous small DNA insertions or dele- 
tions (indels) that disrupt a gene’s function. 
However, many applications require site- 
specific knock-in of large DNA segments, 
such as a therapeutic gene or reporter genes. 
This can be achieved through accurate DSB 
repair by the homology-directed repair 
(HDR) pathway, which requires long se- 
quence homology between a supplied DNA 


types II and V CRISPR systems, Cas9 and 
Cas12, have revolutionized genome editing 
by allowing programmed DNA sequence 
alterations (4). However, robust and tar- 
geted insertion of a large DNA segment into 
eukaryotic genomes has remained chal- 
lenging. On page 48 of this issue, Strecker 
et al. (5) show that a CRISPR-associated 
transposase (CAST) mediates highly effi- 
cient, RNA-guided insertion of cargo DNA 
into the bacterial Escherichia coli genome. 
Moreover, Klompe et al. (6) report another 
CRISPR-guided DNA transposase system 
that operates similarly. These studies offer 
new tools that could transform genetic en- 
gineering and gene therapy research. 

In a typical CRISPR application, Cas9 or 
Cas12 is directed by its guide RNA to the in- 
tended complementary genomic site flanked 
by a protospacer-adjacent motif (PAM), and 
creates a DNA double-strand break (DSB) 
(4). The outcome of gene editing largely de- 
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template and the genomic regions flanking 
the insertion site. HDR is inefficient and 
often restricted to certain stages of the cell 
cycle or by cell type (7, 8). Therefore, there is 
an urgent need for tools that enable robust 
and targeted DNA integration. 

DNA transposons, also known as “jump- 
ing genes,’ can move from one genomic 
location to another, and their relocations 
and insertions are catalyzed by an enzyme 
complex called transposase. Since 2017, 
the CAST systems have been discovered 
through bioinformatics mining of new 
CRISPR variants (9, 10). CAST comprises 
a miniature type I or V CRISPR-Cas en- 
coded within a Tn7-like transposon. The 
mini-CRISPR systems encode protein ma- 
chineries that either lack the Cas3 nuclease- 
helicase of the type I CRISPR system or 
carry inactivating mutations in the catalytic 
residues of Cas12k of the type V system. 
Accordingly, they could be competent for 
target sequence binding but not for DNA 
cleavage. It is speculated that these defec- 
tive CRISPRs have been hijacked by Tn7- 
like transposons to serve a role other than 
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prokaryotic adaptive defense. Instead, they 
might increase the evolutionary success of 
transposons by enabling their RNA-guided 
spread across bacterial genomes and other 
mobile DNA elements (9, 10). The putative 
RNA-programmable nature of CAST con- 
trasts with the poor target site selectivity of 
canonical transposases, which insert their 
cargo DNA semirandomly. 

Strecker et al. established the function- 
ality of two CAST loci from cyanobacteria 
Scytonema hofmanni (Sh) and Anabaena 
cylindrica (Ac) in vivo and elucidated the 
underlying mechanism. Each CAST locus is 
~20 kilobases, comprising Tn7-like trans- 
posase genes (tnsB, tnsC, tniQ), additional 
cargo genes, and genes encoding a type V-K 
CRISPR and its effector Cas12k as well as 


DNA of up to 10 kilobases was integrated 
efficiently into the target plasmid. 

The PAM and target sequence for 
CRISPR-Cas12k remain intact after trans- 
position, yet they cannot support additional 
rounds of effective CAST integration into 
the same downstream region that already 
received a copy of the transposon. This ob- 
servation resembles the “target immunity” 
phenomenon where bacterial T7 transpo- 
son avoids repeated insertion into the same 
site (11). Notably, the 5-base pair sequence 
before each integration site is duplicated 
to flank the other end of the inserted DNA. 
This is consistent with staggered single- 
stranded DNA gaps generated by T7 trans- 
posons (12), which are presumably sealed 
by gap-filling host factors. Taken together, 


CRISPR-guided DNA insertion 
CRISPR-associated transposase (CAST)—mediated 
DNA insertion is guided by CRISPR-Cas12k. The 
inserted transposon can include cargo DNA for 
genetic engineering. 


5-bp 
insertion site 


5 GTN NNNNN NNNNN 3 
3 CAN NNNNN NNNNN 5 


\_____60-66 bp —— 


trans-activating crRNA (tracrRNA), which 
is an invariant small RNA cofactor that is 
essential for Cas12k. The authors assayed 
potential transposition into plasmid DNA 
in the heterologous host FE. coli. A helper 
plasmid encoding all CAST protein and 
RNA components was delivered into E. 
coli, together with a donor plasmid and a 
target plasmid (see the figure). Targeted 
donor integration occurred in one specific 
orientation, with an efficiency of 13 to 75%. 
Integration was guided by CRISPR-Cas12k, 
because it requires tracrRNA, a cognate 
target, and a 5’ flanking GTN PAM. Most 
notably, the insertion sites were clustered 
unidirectionally within a narrow window 
of DNA sequence, 60 to 66 base pairs for 
ShCAST and 49 to 56 base pairs for AcCAST, 
downstream from the PAM. Donor cargo 
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tnsB-tnsC-tniQ —e 
Any base pair 


Duplicated 
5 ) insertion site 


1 CAST system is 
delivered to cells. 


2 CAST locates the 
target DNA sequence 
in host cells via 
guide RNA-target 
complementarity and 
a flanking PAM 
recognized by Cas12k. 


Transposon 
Left Cargo Right 
d 


3 CAST components 
mediate robust cargo 
DNA insertion, which 
does not require 
DNA double-strand 
break repair by host 
cell pathways. 


4 Integration in the 
host genome occurs 
60 to 66 base pairs 
(bp) downstream 
of PAM. 


5 Transposon ends and 
' the intervening 
custom DNA cargo 
are inserted in one 
orientation. 


CAST is a bona fide CRISPR-guided, pro- 
karyotic transposon. 

Strecker et al. reconstituted CAST-cata- 
lyzed DNA insertion in vitro and also dem- 
onstrated that CAST could be repurposed 
as a targeted DNA insertion tool for bac- 
terial genome engineering. ShCAST was 
reprogrammed against 48 different target 
sites in noncoding regions of the E. coli ge- 
nome and achieved targeted insertion at 29 
loci. For a few sites, the efficiencies are im- 
pressively high (50 to 80%), obviating the 
need for positive selection to detect trans- 
position. Insertions were also profiled on a 
genome-wide scale using unbiased deep se- 
quencing. Approximately half of the total 
insertions were on-target, whereas the off- 
target insertions were scattered along the 
chromosome. Many of the top off-target 
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sites occurred repeatedly between FE. coli 
samples for distinct guide RNAs and thus 
are likely caused by a CRISPR-independent 
mechanism. Limiting the amount of trans- 
posase produced in the cells may help re- 
duce off-target effects. 

Klompe ez al. characterize a transposon 
from bacteria Vibrio cholerae that employs a 
mini-type I CRISPR-Cas to insert cargo DNA 
in two possible orientations. The integration 
is CRISPR-guided and robust, and displays 
low (less than 10%) off-target activity. 

Can we leverage the CRISPR-guided DNA 
transposase systems reported by Strecker 
et al. and Klompe et al. for highly efficient, 
targeted genome integration in eukary- 
otes? Compared to the current viral- or 
transposon-based therapeutic gene delivery 
into human cells (13), CRISPR-guided DNA 
transposase systems could be used to in- 
sert custom genes into the desired site and 
therefore avoid oncogenic risks associated 
with random genomic integration. Unlike 
Cas9- or Cas12-based editing tools, CRISPR- 
guided DNA transposase systems do not 
require DSB repair by the host HDR path- 
ways, and therefore may enable highly ef- 
ficient gene knock-ins in a wider variety of 
cell types and tissues. However, a limitation 
is that the terminal ends of the transposon 
will be incorporated together with the inter- 
vening cargo DNA, making applications that 
require “scarless” insertion impossible. The 
discovery of the CRISPR-guided DNA trans- 
posase systems again illustrates the power of 
bioinformatic mining of microbial genomes 
in uncovering new CRISPR variants with 
functions beyond defense (9, 70). As more 
CRISPR-linked accessory genes are discov- 
ered, the noncanonical facets of CRISPR bi- 
ology will be further revealed (14, 15). 
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Seaweed, seaweed everywhere 


Floating Sargassum seaweed species are spreading 


in the Atlantic Ocean 


By James Gower' and Stephanie King? 


hanks to the gas vesicles that give the 
Sargassum genus its name, the open- 
ocean seaweed species S. natans and 
S. fluitans float freely in the ocean. On 
page 83 of this issue, Wang et al. (1) 
report that patches and lines of these 
seaweeds have grown and spread through 
the Caribbean and across the north 
equatorial Atlantic to the west coast 
of Africa and the Gulf of Guinea, 
forming what the authors call the 
great Atlantic Sargassum belt. Other 
species of Sargassum, such as S. mu- 
ticum (2) and S. horneri (3), have also 
increased their range recently but are 
rooted and are probably being spread 
through shellfish shipments and ship 
ballast water. It is the freely floating 
species that are providing the major 
changes noted by Wang e¢ al. 

Greek and Roman classical au- 
thors reported strange seaweeds 
in the Atlantic beyond the Strait of 
Gibraltar, but Columbus was the 
first to give definite reports of Sar- 
gassum, during his voyage to the Americas 
(4-6). Later, Jules Verne, in his novel 20,000 
Leagues Under the Sea, gave Captain Nemo a 
submarine for exploring the world’s oceans, 
partly to enable him to cross the Sargasso 
Sea, the area of the North Atlantic gyre where 
Sargassum tends to accumulate. Sargassum 
does not pose a problem to modern shipping, 
but dense mats would have affected sailing 
vessels when wind speeds were low; further- 
more, it would have suggested dangerously 
shallow waters to sailors who lacked charts, 
were unsure of their position, and were used 
to seeing seaweed only near shore. 

S. natans and S. fluitans are highly vis- 
ible to optical sensors on modern satellites, 
thanks to the high infrared reflectance of the 
chlorophyll a pigment that is present in all 
photosynthesizing plants. For plants float- 
ing on the ocean, this gives high contrast, 
seen against the low infrared reflectance of 
seawater. Furthermore, this pigment has low 
reflectance at visible wavelengths and high 
reflectance in the infrared, separated by a 
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sharp increase (a red edge) at a wavelength 
of ~700 nm. This allows imaging instruments 
with suitable spectral bands to discriminate 
between floating plant life and clouds, haze, 
foam, or reflected sunglint, which lack this 
red edge. However, patches of Sargassum are 
often much smaller than the spatial resolu- 
tion (300 to 1000 m) of ocean-monitoring 
satellite imagers. The satellites therefore 


Sargassum seaweeds have washed up on an ocean beach in Playa del 
Carmen, Mexico, in May 2019. 


probably miss many smaller patches but can 
detect larger accumulations well enough to 
map the major concentrations of Sargassum. 

When first observed by satellite, Sargas- 
sum in the Gulf of Mexico during the “high 
Sargassum year” of 2005 formed patches and 
sinuous lines across the northern parts of the 
Gulf (7). Studies by satellite of its movement 
in the years 2002 to 2010 (8) showed an an- 
nual pattern in which it grew in spring in 
the western Gulf of Mexico, moved east into 
the eastern Gulf and Loop Current, then into 
the Atlantic Gulf Stream and Sargasso Sea 
in the fall. This picture was consistent with 
the distributions reported from earlier ship 
observations (2-4, 9-11), although the Gulf 
of Mexico was not previously seen as an area 
with substantial Sargassum growth. Before 
2011, satellite data showed no sign of major 
quantities of Sargassum in the Caribbean or 
equatorial Atlantic. 

This picture changed dramatically in 
2011 (12), when large amounts of Sargas- 
sum were observed in the Caribbean and 
equatorial Atlantic. Since then, this great 
Atlantic Sargassum belt has continued to 
grow in density and areal extent, albeit 
with considerable interannual variation 
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and with an unexplained annual cycle, as 
shown by Wang et al. Where it has drifted 
ashore in large amounts, especially on Ca- 
ribbean beaches (see the photo), Sargassum 
can have a substantial negative impact on 
tourism. On the other hand, floating Sar- 
gassum at sea is an important habitat for 
many marine species, and the overall long- 
term impact on tourism is not clear. The 
recent change in distribution suggests an 
important, but so far undetermined, change 
in the marine environment. As Wang et al. 
suggest, increased nutrients from coastal 
upwelling and from the Amazon River may 
be a cause of this change. 

Sargassum species are easy to track from 
space. Other marine species of plants or ani- 
mals that have low spectral contrast, do not 
form extended patches, or live at 
depth below the water surface could 
also vary widely in spatial distribu- 
tion, without this being as evident. 
Also, floating marine garbage, espe- 
cially plastics, is known to accumu- 
late in similar patterns but has thus 
far proved impossible to see by satel- 
lite. This is largely explained by the 
lack of infrared contrast and of the 
red edge of chlorophyll a. 

Observers in the Caribbean, Brazil, 
and Africa have called the Sargas- 
sum influx since 2011 unprecedented 
(12). However, there is no easy way of 
determining past distributions, and 
it remains unclear whether similar 
changes in Sargassum distribution 
have occurred before, although DNA tech- 
niques may one day give an answer. In the fu- 
ture, the Sargassum distribution may shrink 
back to its traditional territory, centered on 
the Sargasso Sea, or it may spread into other 
oceans. In either case, present satellite sen- 
sors are well suited to watch. 
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Integrating stress responses and immunity 


Stress is required to assemble immune signaling complexes during infection 


By Philippe Pierre 


he integrated stress response (ISR) is a 
gene expression program that protects 
cells from amino acid deprivation, oxi- 
dative stress, or stress in the endoplas- 
mic reticulum (ER) that is caused by 
protein misfolding and aggregation (J). 
Phosphorylation of the eukaryotic initiation 
factor 2a (eIF2q) initiates the ISR, but it also 
has a role in innate immunity (2, 3). In par- 
ticular, eIF2a is important to harness the full 
capacity of different immune cells to produce 
inflammatory cytokines or type-I interferon 
in response to microbial cues. On page 47 of 
this issue, Abdel-Nour et al. (4) reveal that 
in response to bacterial infection, the ISR 
promotes expression of heat shock protein 
family B-member 8 (HSPB8), which controls 
the assembly of signaling hubs for inflamma- 
tory cytokine production. This involves the 
formation of amyloid fibrils, which are also 
associated with neuroinflammation and neu- 
rodegeneration. Thus, understanding these 
regulatory processes may reveal new targets 
for the prevention of inflammation. 
eIF2a is a member of the heterotrimeric 
eIF2 complex that is required for most forms 
of eukaryotic translation initiation. It medi- 


ates the binding of initiating methionine 
transfer RNA to the ribosome in a guano- 
sine triphosphate (GTP)-dependent man- 
ner. eIF2a phosphorylation on Ser*! creates a 
dominant negative form of the eIF2 initiation 
factor that inhibits translation. In response 
to different stressors, eIF2a phosphoryla- 
tion is mediated by four stress-specific eIF2a 
kinases (EIF2AK1-4) that are individually 
responsible for a global reduction of protein 
synthesis and a distinct activating transcrip- 
tion factor 4 (ATF4:)-dependent gene expres- 
sion program (5). For example, accumulation 
of misfolded proteins in the ER lumen (6) 
promotes the dissociation of HSPA5 chaper- 
one protein (also called BIP) from the ER-lu- 
menal domain of PKR-like ER kinase (PERK, 
also called EIF2AK3). This causes PERK to 
oligomerize and phosphorylate eIF2a in the 
cytosol, which reduces secreted protein syn- 
thesis while favoring ATF4 translation and 
initiates the ISR to restore ER homeostasis. 
Abdel-Nour et al. show that intracellular 
bacterial infection promotes the dissociation 
of HSPB8 from the oxidative stress-activated 
kinase heme-regulated inhibitor (HRI, also 
called EIF2AK1), which activates this kinase. 
This leads to increased phosphorylation of 
eIF2a, mirroring PERK activation. The pro- 


The integrated stress response in inflammation 

Stress-dependent release of HSPs, such as HSPB8, from EIF2AKs promotes elF2a phosphorylation and 
subsequent ATF4-dependent gene expression: the ISR. Concomitant microbial triggering of innate immune 
sensors releases HSPB8 from HRI and favors the assembly of SMOCs and downstream pro-inflammatory 
cytokine expression. ISRIB inhibits this pro-inflammatory process by counteracting elF2a phosphorylation. 
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cess of HSP-dependent activation of HRI 
highlights that all members of this kinase 
family, including EIF2AK2 (also called PKR) 
and EIF2AK4 (also called GCN2), are nega- 
tively regulated through cycles of interaction 
with HSP proteins (7). Thus, negative feed- 
back control of activated EIF2AKs by HSPs 
is likely to be a key regulatory component of 
the ISR and potentially other biochemical 
pathways (see the figure). Two protein phos- 
phatase 1 (PP1) cofactors, PP1 regulatory sub- 
units 15A and 15B, are considered the main 
negative regulators of eIF2a phosphoryla- 
tion (8). Investigating the interplay between 
the inhibitory HSPs and these PP1 cofactors 
will therefore be key to further understand- 
ing how the ISR is terminated and how pro- 
tein synthesis returns to equilibrium after 
stress resolution. 

A surprising discovery of Abdel-Nour et ai. 
is that expression of HSPB8 also controls the 
pro-inflammatory response to Salmonella, 
Shigella, Listeria, or Citrobacter infection. 
Cytokine expression is dependent on HRI ac- 
tivation, which directly releases HSPB8 and 
increases ATF4- and ATF3-dependent expres- 
sion, through the phosphorylation of eIF2a. 
This pathway is important for the activation 
of several key innate immunity signaling 
pathways because it promotes the assem- 
bly of signaling adaptors such as receptor- 
interacting protein 2 (RIP2), mitochondrial 
antiviral-signaling protein (MAVS), or TIR 
domain-containing adapter protein inducing 
interferon-8 (TRIF) into protein aggregates 
that constitute supramolecular organizing 
centers (SMOCs, also called signalosomes), 
which transduce signals (9). A discrimina- 
tive feature of TRIF- or MAVS-containing 
SMOCs, compared to other signalosomes 
(10), is their capacity to adopt an alternative 
folding pattern and associate within proto- 
filaments, thereby forming amyloid fibrils. 
These fibrils are emerging as key signaling 
platforms following the detection of micro- 
bial or danger signals (17). However, their ag- 
gregation-prone properties can be cytotoxic 
and trigger inflammation independently of 
microbial stimuli, as demonstrated by their 
association with neurodegenerative patholo- 
gies such as Parkinson’s disease (PD) (72, 13). 
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Assembly and removal of amyloid SMOCs 
should therefore be tightly regulated to avoid 
adverse effects and progression toward un- 
controlled inflammation. HRI and HSPB8 
are likely to play key roles in this regulation. 
However, it remains unexplained how the 
paradoxical situation of increasing the ex- 
pression of specific chaperones to promote 
SMOC assembly and inducing translation of 
pro-inflammatory mediators can occur when 
protein synthesis is concomitantly inhibited 
by the phosphorylation of eIF2a by HRI. 

These observations raise questions about 
the capacity of cells to respond to the for- 
mation of signaling fibrils and the establish- 
ment of an HRI-dependent ISR that could 
induce pro-inflammatory cytokine release 
during sterile (that is, noninfectious) neu- 
rodegeneration. Neuroinflammation is a 
major component of PD progression, char- 
acterized by neuronal cytoplasmic inclusions 
composed mainly of aggregated a-synuclein 
called Lewy bodies (72). In addition to RIP2, 
MAVS, and TRIF, Abdel-Nour e¢ al. showed 
that overexpression of a-synuclein alone 
activated HRI-dependent responses, includ- 
ing expression of HSPB8. Interestingly, the 
small-molecule drug ISRIB, which reverses 
the downstream effects of eIF2a phosphory- 
lation (74), reduced the capacity of infected 
cells to produce pro-inflammatory cytokines, 
mimicking a deficit in HRI or eIF2a phos- 
phorylation. Given that memory consolida- 
tion is inherently limited by ISR activation in 
neurons, ISRIB-treated mice display signifi- 
cant enhancement in spatial and fear-associ- 
ated learning (15) while also being protected 
from protein aggregation-mediated disease. 
The capacity of ISRIB to reduce neuroinflam- 
mation in response to amyloid SMOC signal- 
ing might therefore explain some of these 
pharmacological effects. m 
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A lost wallet returned to its owner serves as a new proxy for the finder’s civic honesty. 


BEHAVIORAL ECONOMICS 


Financial temptation 
increases civic honesty 


Altruism and self-image, not selfishness, 


drive surprising findings 


By Shaul Shalvi 


oes temptation shape dishonesty? 
For example, when a person finds a 
wallet on the street and decides to 
return it to its owner, it may be be- 
cause the contents of the wallet are 
not very tempting or, alternatively, 
because people care about complying with 
norms of good conduct, that is, civic hon- 
esty. Scientists commonly explore such 
questions about human honesty through 
artificial laboratory tasks, but such stud- 
ies have not provided conclusive evidence 
about the extent to which people are honest 
in natural circumstances. On page 70 of this 
issue, Cohn e¢ al. (1) describe a field experi- 
ment involving 17,000 people in 40 coun- 
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tries to provide a new measure of honesty. 
The results show just how prevalent civic 
honesty is, and they raise many questions, 
such as how environments can be designed 
to foster civic honesty. 

Cohn et al’s work is a prime example of 
the theoretical models that emerge when 
economics and psychology interact. Tradi- 
tionally, economic models have assumed 
that humans act as rational economic 
agents who seek to serve their self-interest 
to increase material gain. To better predict 
human behavior, behavioral economic mod- 
els incorporate nonmaterial utilities, such 
as altruistically caring for others’ outcomes 
(2) or seeking to maintain a positive self- 
image (3, 4). To test these models, behav- 
ioral scientists often engage participants in 
tasks that enable them to be honest or in- 
stead to lie in order to increase their pay. For 
example, in a task that rewards participants 
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according to their self-reported outcome of 
a private rolling of a die, participants, on 
average, report higher numbers than would 
be expected if they were honest, indicating 
that people lie, but only to a modest extent 
(5, 6). Because participants’ die rolls are pri- 
vate, the modest amount of lying suggests 
that people are concerned with maintaining 
a positive self-image. 

Such laboratory tasks are limited by 
their artificial nature and by the fact that 
it is unclear to participants who would 
benefit from their honesty or suffer from 
their dishonesty. The question is whether 
such settings predict honesty outside the 
lab, where dishonesty often harms oth- 
ers. Recent work suggests that lab tasks 
do fairly well, revealing a positive corre- 
lation between participants’ reported die 
roll numbers (as a proxy for dishonesty) 
and their tendency to free-ride on public 
transport (7) or to not return undeserved 
payment (8). Furthermore, in countries 
with high levels of corruption, participants 
report higher die roll numbers (9). What is 
still missing, however, is a direct measure 
for how prevalent honesty is in a natural 
setting, where people face varying degrees 
of temptation and have a clear idea of the 
person or group that benefits from honesty 
and is harmed by dishonesty. 

To disentangle the influence of self- 
interest, self-image, and altruism in a field 
experiment, Cohn et al. handed wallets to 
front-desk employees at major institutions, 
including banks, theaters, and other public 
offices, claiming to have found them on the 
street. Each wallet contained a grocery list, 
a key, and three identical business cards, 
providing people with a way to show civic 
honesty by returning the wallet to its owner. 
To vary temptation, different amounts of 
money were also included in some of the 
wallets. Cohn e¢ al. conducted many control 
tests and analyses to rule out the possibility 
that people would return the wallet out of 
fear of being identified or punished. 

Whereas selfishness predicts that people 
will be less likely to return wallets con- 
taining money, altruism and the desire to 
maintain a positive self-image predict the 
opposite pattern. In 38 of the 40 countries 
studied, wallets with money were returned 
more often than wallets without money, 
which supports the idea that people are 
not purely selfish. Moreover, wallets with 
more money (US$94.15) were more likely 
to be returned than wallets with less 
money (US$13.45). The effect not only con- 
tradicts rational economic thinking, it is 
rather surprising. Both laypeople and ex- 
pert economists predicted the exact oppo- 
site pattern of results in surveys reported 
by the authors. 
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Furthermore, a key is valuable to the wal- 
let’s owner, not the wallet’s finder. In the 
United States, the United Kingdom, and 
Poland, the authors added an experimental 
treatment in which wallets included money 
but no key. Doing so allowed them to as- 
sess the specific contribution of altruism 
to honesty. Indeed, adding a key increased 
the likelihood of the wallet being returned. 
Taken together, these results support the 
idea that people care about others as well as 
caring about being honest. 

Cohn et al’s study provides a new way 
to assess human honesty. The work evokes 
numerous questions. By having the grocery 
list and business cards written in the local 
language, the authors identified the wal- 
let owner as local. Our globalized world, 
however, is diverse. People of different 
ethnicities, backgrounds, and nationalities 
interact. Prosocial and honest behaviors 
are often parochial (J0-12); people find it 
worthwhile to act kindly toward members 
of their own group but not members of 
other groups. The results reported by Cohn 
et al. may thus scale down when civic hon- 
esty is expressed toward nonlocals, includ- 
ing tourists or immigrants. 

Also, showing civic honesty does not just 
mean returning a lost wallet. It is about act- 
ing in a socially desirable way, against one’s 
selfish interests. Understanding when peo- 
ple are likely to engage in civic honesty— 
such as by whistleblowing in response to 
suspected organizational wrongdoing, or by 
obeying rules even when exposed to others’ 
corrupt behavior—is important. By continu- 
ing to push the methodological boundaries 
and the proxies used to assess sensitive be- 
haviors such as honesty, researchers aspire 
to figure out how to better design our envi- 
ronments and organizations to foster such 
desired behaviors. 
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LIGHT METALS 


Processing 
magnesium 
at room 
temperature 


In situ microscopy 
experiments show how 
pyramidal slip increases 
magnesium’s ductility 


By Gwénaélle Proust 


luminum (Al) alloys are often used 
for lightweight applications such 
as airframes, but with proper alloy- 
ing elements and heat treatments, 
magnesium (Mg) alloys can reach 
strengths comparable to those of 
some Al alloys but be 35% lighter (7). None- 
theless, the use of Mg alloys has been lim- 
ited by their poor processability at room 
temperature and low corrosion resistance 


“The low ductility of Mg 
alloys is associated with 
their hexagonal 
crystalline structure.” 


(2). The latter can be improved either with 
additional alloying elements (2) or by refin- 
ing their grain size (3). Improvements to the 
ductility of Mg, which controls its process- 
ability, will require better understanding of 
its plastic deformation mechanisms and the 
parameters controlling their activities. On 
page 73 of this issue, Liu et al. (4) offer new 
insight into how Mg deforms, and more 
specifically, on the role of slip along the py- 
ramidal planes of its unit cell in accommo- 
dating large deformation (see the figure). 
The potential for energy savings with 
lighter alloys is especially appreciated by the 
transportation industry (5). In recent years, 
the Volkswagen group has used around 14 
kg of Mg in the Audi A4 and A6, and Gen- 
eral Motors has used 26 kg in the GMC 
Savana (6). The forming of wrought Mg al- 
loys—shaping them by rolling, extruding, 
or forging—is costly and time consuming 
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because of the low deformation 
capability of these alloys at low 
temperatures. Shaping processes 
at room temperature require re- 
peated cycles of small deforma- 
tion and thermal annealing to 
prevent cracking. The production 
of bars, rods, sheets and plates at 
high temperatures can avoid low- 
temperature multiple-stage pro- 
cessing (7), but high-temperature 
deformation methods come with a 
substantial energy penalty. 
Improving the ductility of Mg 
alloys at low temperatures would 
not only decrease the cost associ- 
ated with their production but also 
would create microstructures in the 
alloy that could potentially improve 
their performance for structural ap- 
plications. The low ductility of Mg 
alloys is associated with their hex- 
agonal crystalline structure. Most 
other engineering metals are more 
ductile because they have a cubic 


crystalline structure that allows Pyramidal slip Pyramidal slip 
the dislocations that accommo- _ Plane type! plane type Il 
date plastic deformation to move (1011) (1122) 


more readily. The structure in Mg 

has only limited ways for disloca- 

tions to easily glide, so deformation 

must be accommodated by other 
mechanisms, such as_ twinning, 

grain boundary sliding, and recrystallization 
(see the figure). The diversity of the deforma- 
tion mechanisms available results in the high 
plastic anisotropy of Mg, which limits its 
ability to accommodate deformation in the 
directions normal to the loading direction. 
The deformation mechanisms that operate 
depend on processing parameters such as 
temperature, loading direction, deformation 
rate, and grain size (8, 9). 

At low temperatures, some of these 
mechanisms are not available, which results 
in cracking. Different approaches have been 
proposed to improve the ductility of Mg 
at room temperatures, including grain re- 
finement (10), modification of texture (the 
distribution of the various crystallographic 
orientations present in the material) (77), 
and alloying (72). All these approaches aim 
to increase the activity of nonbasal slip 
(basal slip being the most favorable slip 
mode in Mg and its alloys); more specifi- 
cally, to accommodate deformation along 
the c axis of the Mg hexagonal unit cell, (c + 
a) dislocations (ones that move in the direc- 
tion in the unit cells that combines the (c) 
and (a) unit vectors) should glide on pyra- 
midal planes. 

There are actually two types of pyrami- 
dal planes in Mg: type I in the (1011) plane 
and type II in the (1122) plane. Because 
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Mapping magnesium deformation 
Atoms in magnesium accommodate strain through several mechanisms 
when the metal is deformed. Liu et a/. studied one mechanism, slip along 

pyramidal planes, with electron microscopy while the metal was strained. 
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Packing atoms 
Two families of pyramidal planes can accommodate deformation 
in this hexagonal close-packed metal. 


these two types of planes have different 
atomic densities, the energy required to 
move dislocations on them is different. To 
date, no consensus has been reached on 
which of these types of planes is prevalent 
during Mg deformation (73). The other dif- 
ficulties come from studies showing con- 
tradictory results regarding the mobility 
of the (c + a) dislocations in pure Mg. Ah- 
mad et al. stated that (c + a) dislocations 
are essentially sessile Gmmobile) but that 
this problem can be overcome with proper 
alloying (14). Kumar et al. showed that al- 
though (c + a) dislocations on pyramidal 
type I planes become sessile, the ones on 
pyramidal type II planes remain glissile 
under stress (15). 

Liu et al. present results related to both 
controversies. They used advanced in situ 
transmission electron microscopy (TEM) 
mechanical testing techniques to apply a 
compressive load at room temperature on 
Mg micropillars fabricated from single crys- 
tals. Both pyramidal type I and pyramidal 
type II planes participate to plastically de- 
form Mg. Their recording of the visualized 
deformation synchronized with the me- 
chanical response of the material showed 
unequivocally the relation between strain 
accommodation and the formation and mo- 
tion of (c + a) dislocations. They also used 
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Accommodating strain 
~ Changes in microstruc- 
ture characterized with 
electron backscatter 
diffraction that resulted 
from straining a 
magnesium alloy are 
shown as changes in 
* color and smaller grains. 
Three different 
~ mechanisms—slip, 
twinning, and recrystalli- 
“, zation—simultaneously 
* accommodate strain in 
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three-dimensional reconstruction 
techniques on the TEM images 
captured at different tilting angles 
to identify the planes (pyramidal 
type I or II) on which different 
dislocations are lying and to even 
identify dislocation cross-slip. The 
glissile character of the disloca- 
tions is confirmed on both types 
of pyramidal planes by molecular 
dynamics simulations in which (c 
+ a) dislocations nucleate because 
of compressive loading along the 
c axis, instead of by being artifi- 
cially introduced in the model. 
The reversibility of dislocation 
motion is demonstrated through 
cyclic loading observed during in 
situ TEM testing. 

The findings of Liu et al. rein- 
force the idea that promoting (c + 
a) dislocation activity in Mg can 
increase its processability. Using 
the proposed molecular dynamics 
model, it will become possible to 
identify microstructures that are 
favorable to increase ductility in 
Mg and its alloys at room tem- 
perature. The combined experi- 
mental and modeling approach 
used in this study can be repeated 
on different Mg alloys to confirm 
and better understand the role 
of alloying elements in the ductility of Mg, 
which will allow the development of new 
wrought Mg alloys. = 
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NEUROSCIENCE 


Targeting microglia in brain disorders 


The heterogeneity of brain immune cells could be exploited therapeutically 


By Josef Priller'?* and Marco Prinz*>* 


hen the neuroscientist Pio del Rio 

Hortega first characterized “mi- 

croglia” in 1919 using silver carbon- 

ate staining of animal and human 

brain tissues, he was impressed by 

the morphological plasticity and 
capacity to phagocytose (ingest) of the resi- 
dent immune effector cells, called microglia 
(1). A century later, technological advances in 
molecular biology, imaging, and single-cell 
analysis have provided fascinating insights 
into the dynamic changes of microglia in re- 
sponse to aging and brain diseases, revealing 
their potential as therapeutic targets. 

Microglia are specialized tissue-resident 
macrophages in the central nervous system 
(CNS). They develop from immature yolk sac 
progenitor cells during early embryogenesis 
and persist throughout life. Microglia rely 
on cytokine signaling, such as activation of 
colony-stimulating factor 1 receptor (CSF1R) 
and transforming growth factor-B1 (TGF-BD, 
for their survival. Notably, heterozygous loss- 
of-function mutations in the human CSFIR 
gene cause an autosomal-dominant pro- 
gressive degenerative white-matter disorder 
called adult-onset leukoencephalopathy with 
axonal spheroids and pigmented glia (ALSP), 
and homozygous CSFIR mutations cause 
brain malformations with almost complete 
absence of human microglia (2). A somatic 
BRAF-Val®°Glu mutation in human microg- 
lia results in late-onset neurodegeneration 
in individuals with histiocytosis (excess of 
macrophages) (3). This suggests that microg- 
lial dysfunction can be the primary cause of 
neurological diseases in humans. 

Microglia can also promote neurodegen- 
eration secondary to infections (e.g., HIV 
infection), exposure to misfolded proteins 
(as occurs in many neurodegenerative dis- 
orders), or CNS injury. Whereas microglia 
were traditionally considered to quickly re- 
spond to any pathological event in the brain 
and to help restore homeostasis (albeit at the 
price of potential bystander damage, such as 
through the release of toxic molecules or pro- 
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inflammatory mediators), it is now becoming 
clear that this passive role of microglia in re- 
sponding to damage falls short in reflecting 
their complex functions in the CNS. Indeed, 
microglia may themselves initiate neural 
dysfunction and neurodegeneration through 
loss of homeostatic and/or gain of aberrant 
function (see the figure). Consistently, many 
genetic alterations that are associated with 
increased risk for neurological and psychiat- 
ric disorders, including schizophrenia, autism 
spectrum disorder, frontotemporal dementia, 
and Alzheimer’s disease (AD), affect genes 
that are highly expressed by microglia. As a 
result, microglial functions have emerged as 
possible targets for CNS disorders. 

Given their embryonic origin and depen- 
dence on self-maintenance in the absence of 
turnover with circulating monocytes, microg- 
lia may be much more vulnerable to aging 
and degeneration than previously assumed. 
Recent transcriptomic studies have revealed 
substantial age-dependent changes in the 
expression of genes involved in cell motil- 
ity, adhesion, axonal guidance, and immune 
function in microglia, reminiscent of the 
changes observed in some neurodegenerative 
diseases (4). The sialic acid-binding receptor 
CD22 is up-regulated on aged microglia, and 
CD22 blockade restores microglial homeosta- 
sis and improves cognitive function in aged 
mice (5), underscoring the therapeutic poten- 
tial of targeting microglial functions. A dis- 
ease-associated microglia state was recently 
identified in the brains of AD transgenic mice 
that may exert protective functions by clear- 
ing amyloid-B protein aggregates, which are 
thought to be pathological in AD (6). These 
neurodegeneration-associated microglia ex- 
press many of the genes that have been as- 
sociated with increased risk for AD and other 
neurodegenerative disorders in genome-wide 
association studies, including cathepsin D 
(CTSD), lipoprotein lipase precursor (LPL), 
TYRO protein tyrosine kinase-binding pro- 
tein (TYROBP), apolipoprotein E (APOE), and 
triggering receptor expressed on the myeloid 
cells 2 (TREM2). Single-cell transcriptomic 
analysis of the prefrontal cortex in humans 
with AD pathology confirmed the presence 
of disease-associated microglia signatures 
(7). Among the most highly expressed genes 
in microglia were secreted phosphoprotein 
1 (SPPI), protein tyrosine phosphatase re- 
ceptor type G (PTPRG), apolipoprotein Cl 
(APOCI), and APOE, all of which have been 
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associated with increased risk of AD. 
Disease-associated microglia states were 
also discovered in the brains of humans 
with multiple sclerosis (MS), including 
SPP1* microglia and CTSD* microglia (8). 
Their transcriptional profiles resembled 
those of microglia phenotypes identified in 
a mouse model of demyelination, which is 
a hallmark of MS pathology (8). In contrast 
to neurodegeneration-associated microglia, 
increased expression of genes involved in 
interactions with other immune cells, in- 
cluding CD74, and the major histocompat- 
ibility complex (MHC) class II molecules, 
was detected in MS-associated microglia. 
This may reflect the chronic neuroinflam- 
mation that is observed in MS. 
Technological advances in single-cell mass 
cytometry have enabled the validation of 
some of the transcriptional signatures of dis- 
ease-associated microglia at the protein level 
(single-cell proteomics) (9). These studies 
detected differences in the immunopheno- 
typic signatures of microglia between mouse 
models of neurodegeneration and neuro- 
inflammation; whereas the former showed 
up-regulation of CD14 in a small subset of 
microglia, the latter were characterized by 
the global up-regulation of MHC class II and 
stem cell antigen 1 (SCA1) in microglia. Fu- 
ture lineage-tracing studies will help to deter- 
mine whether disease-associated microglia 
are a distinct, albeit heterogenous popula- 
tion, a mixture of different subpopulations, 
or a reflection of dynamic states of microglia. 
From a therapeutic perspective, it would 
seem attractive to specifically target these 
disease-associated microglia phenotypes 
based on the molecular signatures according 
to the type of neuropathology. The modula- 
tion of checkpoint mechanisms that normally 
keep microglia in a homeostatic state and 
are down-regulated during the develop- 
ment of disease-associated microglia, such 
as the receptors CX3C chemokine receptor 1 
(CX3CR1), P2Y purinoceptor 12 (P2Y12). and 
TGF-81 receptor (TGFBR1), as well as the 
TREM2-APOE pathway, appears particu- 
larly promising (6). By contrast, long-term 
depletion of microglia or administration of 
immunosuppressant drugs may not be the 
preferred treatment choice for neuropsychi- 
atric disorders due to the resulting loss of 
homeostatic functions of microglia, such as 
the modulation of synaptic plasticity, the sup- 
ply of trophic factors for neurons and other 
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types of glia, and the participation in patho- 
gen defense. 

Microglia are highly dependent on envi- 
ronmental signals to maintain their homeo- 
static phenotype. This results in distinct 
brain region-dependent transcriptional iden- 
tities, which are associated with differential 
vulnerability to aging (4). Immune profiling 
of human brain microglia by single-cell pro- 
teomics recently revealed a core immunophe- 
notypic signature with considerable regional 
heterogeneity (70). Notably, microglia subsets 
in the frontal and temporal cortex expressed 
higher amounts of mannose receptor C type 1 
(CD206) compared to microglia subsets in 
the subventricular zone and thalamus that 
expressed higher levels of the proliferation 
markers cyclin A, cyclin Bl, and Ki67. These 
findings have important thera- 
peutic implications because mi- 
croglia may be targeted not only 
in a disease-specific, but also in 
a brain region-specific, manner. 
This refined approach could fa- 
cilitate the treatment of CNS dis- 
orders with regionally confined 
pathology, such as stroke or 
Parkinson’s disease, by enabling 
site-specific modulation of mi- 
croglial function that spares 
nonaffected tissue. 

Unfortunately, microglia can- 
not be easily accessed in the CNS. 
The blood-brain barrier, which 
is formed by tight junctions be- 
tween endothelial cells in cere- 
bral blood vessels, regulates the 
trafficking of cells and larger 
molecules into the CNS. As an 
exception to the rule, the choroid 
plexuses and circumventricular 
organs contain fenestrated blood vessels that 
facilitate exchange processes with the periph- 
ery. The therapeutic relevance of these gates 
into the CNS from peripheral blood has yet 
to be fully explored. Initial attempts to target 
genetically engineered hematopoietic cells 
injected into the blood to sites of damage in 
the brain in order to replace microglia relied 
on CNS irradiation (J7). In two landmark 
clinical studies on inherited peroxisomal 
and lysosomal storage disorders (X-linked 
adrenoleukodystrophy and metachromatic 
leukodystrophy) that result in severe motor 
and cognitive deficits, children were treated 
successfully with chemotherapy, which also 
conditions the CNS for immune cell engraft- 
ment. This was followed by reinfusion of he- 
matopoietic stem cells that were genetically 
corrected ex vivo with lentiviral vectors, re- 
sulting in expression of the deficient proteins 
in the CNS (22, 13). 

Dysfunctional microglia can also be de- 
pleted locally or throughout the brain with 
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pharmacological inhibitors of CSF1R signal- 
ing or with liposome-encapsulated bisphos- 
phonates that trigger apoptosis specifically 
of microglia. As the microglial population is 
under strong homeostatic pressures, rapid 
replenishment occurs from local pools (/4). 
The newly engrafted cells temporarily ac- 
quire functions that are distinct from those of 
preexisting dysfunctional microglia (14). The 
process of replenishment also provides an 
opportunity to introduce gene-modified cells. 
Competition of myeloid cells (such as mono- 
cytes and macrophages) for niches within the 
CNS that have been generated by the loss or 
dysfunction of microglia can affect the effi- 
ciency of replacement with engineered cells. 
The factors that govern competitive engraft- 
ment are not fully understood and have yet to 


Microglial heterogeneity 
Regional heterogeneity of microglia can be observed during homeostasis. 

During pathology, disease-associated microglia emerge with distinct transcriptional 
profiles that reflect specific activation states. 


target disease-associated microglia states 
without affecting homeostatic microglia or 
peripheral tissue macrophage populations. 
These compounds would need to cross the 
blood-brain barrier to gain access to the 
brain parenchyma, and in the case of gene 
engineering, efficient vectors for human 
microglia need to be available that do not 
trigger inflammation. The protocols for the 
engraftment of genetically engineered mye- 
loid cells in the human CNS also need to be 
optimized. Interventions may need to be 
temporally restricted if disease-associated 
microglia are dynamic states. Thus, the de- 
velopment of new radioligands for positron 
emission tomography (PET)-based moni- 
toring of microglial activity in the human 
brain, e.g., isotope-labeled ligands for P2Y12 
or TREM2 receptors, will be of 
importance. Dynamic modu- 
lation of microglial function 
(activation and inhibition) to ac- 
count for the spatiotemporally 
differential roles of disease- 
associated microglial states, 


“~™» Cross-talk and for the complex interac- 
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% Damaged neuron Constitutive regional immune cells, such as border- 
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be exploited therapeutically. Aged microglia 
or microglia carrying genetic and/or epigen- 
etic defects may have a competitive disad- 
vantage. Thus, future attempts to modify the 
function of microglia or microglia subpopula- 
tions, e.g., by gene editing, on a whole-brain 
or region-specific level need to consider mi- 
croglia population dynamics. Along these 
lines, it is important to note that somatic 
mosaicism (cell clones with distinct acquired 
mutations) in microglia plays a role in CNS 
disorders, such as histiocytosis and ALSP (3, 
15). The selection of the (regionally) “fittest” 
microglia may be exploited for therapeutic 
purposes, e.g., gene-edited microglia may 
outcompete microglia with disease-causing 
mutations, and microglia may be engineered 
that maintain homeostatic functions or resist 
toxic microenvironments. 

Many obstacles need to be overcome 
before microglia can be exploited for the 
therapy of CNS disorders. It is likely to be 
important to develop drugs that specifically 
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has substantially increased in 
recent years and may eventually 
allow us to modulate microglial 
function indirectly and without ingress into 
the CNS. Given the lack of effective thera- 
peutic options for many neurological and 
psychiatric diseases, further understanding 
and eventual targeting of microglial func- 
tions in the CNS could hold promise. 
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Regulating genetic biohacking 


Emphasize community engagement, not perfect compliance 


By Patricia J. Zettler,! Christi J. Guerrini,” 
Jacob S. Sherkow*** 


ust as the popularization of comput- 

ers in the late 1970s and early 1980s 

gave rise to computer hacking, the 

recent accessibility and affordabil- 

ity of relatively easy (and widely 

hyped) genome-editing technologies 
and resources has spurred interest in ge- 
netic “biohacking”—molecular genetics ex- 
periments performed outside institutional 
laboratories by individuals who may have 
little formal scientific training. Regulation 
of the work of professional scientists and 
traditional scientific institutions is robust, 
although it still faces scrutiny in the wake 
of He Jiankui’s genome-editing experi- 
ments on Chinese twins (J). However, regu- 
lation of genetic biohacking has received 
far less attention, even though, like tradi- 
tional scientific research, it is likely to pro- 
duce a range of innovations while posing a 
number of risks to public health. Here, we 
explore these risks and the consequences 
of understanding that some instances of 
regulatory failure for biohacking are inevi- 
table. And, where they are not, we suggest 
that agencies, policy-makers, and private 
parties have the opportunity to improve 


oversight of genetic biohacking using the 
tools they currently possess. 


GENETIC BIOHACKING 
Experiments to modify genetic expression 
that once required specialized training and 
substantial investments in equipment and 
reagents can now be conducted for a few 
hundred dollars and with a basic instruc- 
tion manual. Genomic sequencing can be 
done using portable pocket devices, some of 
which cost less than a plane ticket. The rise 
of direct-to-consumer genetic testing has 
also resulted in individual access to raw ge- 
netic data, fueling a variety of health, well- 
ness, ancestry, and relative identification 
services that offer to interpret those data. 

As these technologies go mainstream, 
some individuals have begun conducting 
genetic experiments outside of traditional 
scientific labs, such as those associated with 
universities, research institutions, and reg- 
ulated corporate entities. Some of these ex- 
periments have involved humans, although 
thus far they appear to be limited to self- 
experimentation with one’s own body—an 
activity that has an ancient pedigree in tra- 
ditional medical research. 

The motivations of these genetic biohack- 
ers, some of whom lack any formal training 


in biology, are diverse and often complex. 
Some appear to be motivated by norma- 
tive beliefs in a “right to do science.” Others 
place a high value on bodily autonomy or 
creative expression—a right to experiment 
on themselves or use genome editing for 
expressive purposes. Some view biohack- 
ing as a means of self-care, where, for ex- 
ample, they experiment with alternatives 
to (sometimes expensive) regulated drugs. 
Still others harbor views that traditional 
scientific institutions are poor regulators of 
themselves or are slow and needlessly cum- 
bersome. And some, frankly, are moved by 
anti-government sentiments. 

Reflecting these diverse motives, recent 
reports of genetic biohacking include a 
broad array of experiments: genetic modifi- 
cation of bacteria, yeast, plants, nonhuman 
mammals—and also humans, in the form of 
genetic self-experimentation. This includes, 
for example, self-injecting homemade ge- 
netic material in attempts to change the 
expression of muscle growth factors to 
improve strength or to treat diseases such 
as HIV or herpes (2). Where self-experi- 
mentation is undertaken by groups that 
coordinate their efforts, these activities can 
begin to look like decentralized clinical tri- 
als. Some biohackers might also attempt to 
experiment on others. Although there are 
no documented instances of this to date, 
biohackers have reported (and expressed 
concerns about) being approached by indi- 
viduals asking for help treating their own 
or their family members’ health conditions. 
Genetic biohacking of this sort—experimen- 
tation on oneself and others—poses public 
health risks. These include interventions 
with poor safety or efficacy, a lack of true 
informed consent, and the introduction and 
uptake of unsafe and unproven “therapies” 
into commerce. The democratization of 
genetic biohacking exacerbates these pub- 
lic health risks because many experiments 
make use of easy-to-obtain materials and 
equipment purchased from companies that 
cater to the do-it-yourself (DIY) market or 
freely provided by other biohackers. 

There are other emerging areas of DIY 
science, such as neurohacking and self- 
manufacture of traditional pharmaceutical 
products, that do not focus on molecular ge- 
netics but that similarly raise public health 
concerns. Genetic biohacking, however, is 
especially easy, affordable, and a particu- 
larly popular and promising form of DIY 
science that poses unclear but potentially 
serious or far-reaching risks. These include, 
for example, sick individuals foregoing 
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known, effective treatments in the hope of 
cheaper and unproven DIY genetic inter- 
ventions or, at an extreme, harmful germ- 
line modifications. As has been the case for 
“alternative” cancer treatments and autolo- 
gous stem cell transplants, hype and access 
to biological reagents have the potential to 
pose substantial public health concerns. Ge- 
netic biohacking communities, therefore, 
should be an important focus for regulating 
DIY science. 


GOVERNMENT REGULATION 

As biohacking has become more prevalent 
and public, scholars, ethicists, and regula- 
tors have voiced concerns that government 
oversight may be absent or inadequate to 
address the risks that these activities may 
pose. Indeed, biohackers sometimes work 
in private, whereas traditional research is 
conducted in teams overseen by institu- 
tions. Biohackers generally do not obtain 
ethical review of their work, in contrast 
to traditional biological research. Further- 
more, biohackers are often self-funded and 
are thus not typically accountable to private 
or agency funders, unlike their traditional, 
professional counterparts. 

Contrary to the conventional wisdom, 
however, genetic biohacking does not oc- 
cur in a legal or ethical “wild west” (J). 
In the United States, there are a number 
of laws and regulations that appear to be 
relevant. We focus on the U.S. regulatory 
landscape because the United States is a 
popular site for genetic biohacking and the 
home of the earliest community laborato- 
ries. Unlike some European countries, the 
United States does not ban genome editing 
conducted outside of licensed laboratories, 
although it is not unlikely that such a ban 
would be proposed if it is discovered (as it 
was with He) that some genetic biohackers 
have crossed generally observed lines of 
ethics or safety. Our objective is to help U.S. 
regulators better prepare for that day. Al- 
though our recommendations may not pre- 
cisely translate to other countries because 
each jurisdiction has a unique regulatory 
system and philosophy, they may nonethe- 
less be broadly informative of potential 
regulatory responses. 

The U.S. Food and Drug Administration 
(FDA), for example, has extensive power 
to regulate the public health impacts of 
genetic biohacking, with jurisdiction that 
reaches farther than many biohackers re- 
alize. In many circumstances, the things 
used by genetic biohackers—such as raw 
biological materials, traditional drug prod- 
ucts, and DIY CRISPR kits—are, by stat- 
ute, FDA-regulatable drugs. Other articles, 
such as human gene therapy products, also 
come within FDA’s purview over biologics 
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(3). Moreover—and contrary to popular be- 
lief—money need not always change hands 
for an item to fall within FDA’s jurisdiction, 
a wrinkle important for biohackers who 
freely provide or exchange materials (4). 
This view of FDA’s authority was bolstered 
in November 2017 when, in response to con- 
cerns about individuals using DIY CRISPR 
kits for self-experimentation, the agency 
stated that “any use of CRISPR/Cas9 gene 
editing in humans [is] gene therapy” and 
therefore subject to regulation (5). 

FDA has yet to vigorously enforce its 
power in this area. To date, it has not taken 
public enforcement action against any bio- 
hackers conducting genome editing. But 
this is consistent with FDA’s flexibility to 
exercise “enforcement discretion” in decid- 
ing which violations merit formal action 
given limited enforcement resources. Ge- 
netic biohacking may also make it practi- 
cally difficult for FDA to identify violations 
that do occur, especially when committed 
by individual experimenters or small-scale 
biohacking communities. 

Even so, this does not mean that new or 
more powerful regulations are warranted. 
Where FDA has chosen not to formally 
wield its enforcement power, the agency 
still has a role in community engagement— 
education, warning, and standard-setting 
for activities that pose public health risks 
and that otherwise fall within its purview. 
An important function of the agency is to 
encourage communication and disclosure 
for traditional, commercial research (6). 
Through its longstanding role in assessing 
drugs and biological products, FDA is the 
government regulatory agency equipped 
with the expertise to assess the safety and 
effectiveness of genetic biohacking. FDA 
involvement, therefore, may help to realize 
the promise of genetic biohacking through 
guiding biohacking efforts toward interven- 
tions that live up to the communities’ hopes. 

Although FDA has begun to show inter- 
est in genetic biohacking—as evidenced by 
its November 2017 statement and the par- 
ticipation of officials in a 2018 “bio-citizen” 
workshop hosted not by the agency but 
by the Woodrow Wilson Center (5, 7)—the 
agency has seemingly not yet taken the 
reins through a proactive effort to deeply 
engage with or understand biohacking 
communities. Given some biohackers’ con- 
tinued confusion about FDA’s authority 
over their work, the agency might begin by 
clarifying the boundaries of its jurisdiction, 
in lay terms and in sufficient detail to cover 
diverse biohacking activities, while seeking 
feedback from biohacking communities on 
how FDA could best exercise its authority in 
this space. This would provide biohacking 
communities more certainty about where 
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they stand and potentially encourage new 
and innovative biohacking activities that 
might have been deterred by uncertainty 
about FDA enforcement. At the same time, 
these activities will help the agency to avoid 
repeating the mistakes it made with the 
stem cell industry, where the rapid expan- 
sion of clinics offering unproven interven- 
tions to patients is attributed to years of 
uncertainty around the scope of FDA ju- 
risdiction and limited agency engagement. 
The agency also could draft guidances about 
typical genetic biohacking experiments and 
identify staff as points of contact for those 
engaged in genetic biohacking who would 
like to communicate with the agency. FDA’s 
lack of current engagement is a shame, but 
not one that merits revamping of the agen- 
cy’s powers. 

A similar approach has thus far proved 
successful for other federal authorities in 
different contexts. The risks posed by bio- 
hacking in the context of “bioterrorism,” for 
example, have been the subject of study by 
the Federal Bureau of Investigation’s (FBI) 
Biological Countermeasures Unit (8). To 
police the threat of biohacking-mediated 
bioterrorism, and in contrast to FDA’s work, 
the FBI has developed strong relationships 
with community labs, where some genetic 
experimentation is occurring. 

Efforts at community engagement should 
focus on the potential public health harms 
posed by genetic biohacking, such as ad- 
verse effects from the administration of 
homemade gene therapies, contamination 
of the environment from poorly kept ge- 
netic reagents (such as viral vectors), and 
the forgoing of traditional treatments in 
favor of DIY experimental ones. Specific 
risks (and potential benefits) of genetic bio- 
hacking involving humans will depend on 
the context of their use. Thus, assessment 
should include, for example, ascertaining 
whether a homemade genetic intervention 
is intended as a therapy for a disease with 
no known treatment, a disease for which 
there are known effective treatments, or for 
some other purpose, such as an enhance- 
ment or aesthetic use. 


PRIVATE REGULATION 

Genetic biohacking is also potentially sub- 
ject to U.S. laws that are enforced by pri- 
vate rather than government actors. These 
may fill some of the gaps in public regula- 
tors’ ambit (9). Patent owners, for example, 
can impose ethical restrictions on licens- 
ees, such as the Broad Institute’s licenses 
for its CRISPR patents to Bayer (formerly 
Monsanto), with conditions that Bayer 
avoid research activities that are potentially 
harmful to public health, including tobacco 
research and germline editing (10). Such li- 
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cense restrictions can—and should—be used 
to police commercial manufacturers of ge- 
nome-editing kits and reagents popular in 
biohacking communities, just as they have 
previously been used to prevent activities 
that pose national security, environmen- 
tal, or public health risks (17). Even with- 
out a license in place, patent owners can 
enforce restrictions through threats of pat- 
ent infringement litigation against any re- 
calcitrant biohackers or manufacturers of 
biohacking products. A similar model was 
proposed as an attempt to restrict the use 
of “gene drive technology’—inheritable ver- 
sions of CRISPR designed to drive a specific 
allele through generations of a population 
(12). Beyond patents, people injured by ge- 
netic biohacking materials could potentially 
bring tort law claims against biohackers 
and component suppliers to seek compen- 
sation for their injuries. A person injured 
while using a DIY CRISPR kit, for example, 
would likely be able to sue the seller of the 
kit—a potentially strong deterrent to mar- 
keters of unsafe biohacking materials. 

Apart from these legal mechanisms, some 
biohacking communities have adopted their 
own ethics restrictions, which, even if not 
intended to do so, might indirectly avoid 
harms to public health caused by genetic 
biohacking. The International Genetically 
Engineered Machine (iGEM) Competition, 
for example, requires its participants to 
comport with a strict program of bioethics 
(13). The International Gene Synthesis Con- 
sortium—a group of commercial suppliers 
of genetic materials—developed protocols 
for screening orders and verifying custom- 
ers in an effort to prevent dangerous uses. 
For example, protocols may instruct suppli- 
ers to decline orders for delivery to home 
addresses or post office boxes (9). Although 
this is an important effort, some biohackers 
have nonetheless devised ways to pass such 
screening by, for instance, registering busi- 
nesses for the purpose of obtaining institu- 
tional addresses. 

Another example of self-governance is 
community labs’ adoption of safety poli- 
cies, which often include standards detailed 
in the cornerstone of biosafety practices 
in the United States, the Biosafety in Mi- 
crobiological and Biomedical Laboratories 
guidance document. These policies include 
restrictions on experimentation in humans, 
one of the riskiest forms of genetic biohack- 
ing with the largest potential negative con- 
sequence to public health. A grant-funded 
effort spearheaded by North Carolina State 
University is currently under way to un- 
derstand and improve on community labs’ 
safety policies with guidance from biosafety 
officers established in three labs. Analo- 
gously, a code of ethics adopted in 2011 by 
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an organization of DIY biologists, DIYbio. 
org, remains an important touchstone for 
experiments (14). 

Given that many biohackers who conduct 
work at home are also members of com- 
munity labs (15), their safety policies have 
the potential to go a long way in promoting 
safety in genetic biohacking. Outside the 
norms of community labs and traditional 
scientific institutions, many who engage 
in biohacking activities nonetheless rely 
on each other for materials and informa- 
tion, providing a positive downstream ef- 
fect to community labs’ ability to police the 
conduct of biohacking communities. These 
collaborations might also encourage trans- 
parency between biohackers affiliated with 
community labs and those outside the com- 
munity lab niche. 

Like government regulation, private gov- 
ernance of this sort is important and laud- 
able but not a perfect or comprehensive 
solution. Private actors may not be inclined 
to regulate conduct that poses few risks 
to them, even if safety risks to others are 
numerous, obvious, and serious. In other 
cases, the social stigma of violating com- 
munity norms may simply be an ineffec- 
tive deterrent. Additionally, community 
labs’ private governance efforts only weakly 
reach genetic biohacking communities fo- 
cused on human experimentation or in lo- 
cations where community labs are absent. 


MOVING FORWARD 

The existence of public and private gover- 
nance mechanisms to mitigate the public 
health risks and encourage the innovative 
potential of biohacking—even if currently 
infrequently used—means that regulators 
can better implement these mechanisms 
rather than rely on new ones to be grafted 
into law. For example, calls for bans on bio- 
hacking, such as those from a consumer ad- 
vocacy organization in Australia, go too far. 
The tools for public and private regulators 
to manage biohacking’s public health risks 
are largely already available. But they must 
be used better. 

As with other issues pertaining to public 
health, this also means that the future of 
regulating biohacking lies not only in more 
stringent policing, but in better education 
of its participants and a realistic under- 
standing that violations will be inevitable. 
Education would help private actors to un- 
derstand the risks posed by certain forms of 
biohacking and to appreciate FDA’s role in 
both consumer protection and fostering of 
innovation (6). Likewise, public regulators 
such as FDA would benefit from engaging 
with stakeholders to better understand ge- 
netic biohacking activities, its participants’ 
perspectives, and biohacking communi- 
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ties’ potential for self-governance—much 
as it already does with the pharmaceutical 
industry. Even with limited enforcement 
resources, agencies such as FDA have an 
opportunity to advance public health by 
working with biohacking communities as 
their practices and norms are being devel- 
oped—and before potentially problematic 
norms of risk, secrecy, and mavericks be- 
come widespread. 

No government or private policy will ever 
achieve perfect compliance, even in tradi- 
tional scientific settings—as He’s experi- 
ments painfully demonstrate. There will 
always be “rogue actors” who may main- 
tain connections with their institutional 
communities even after being “caught” 
(1). Striving for perfect compliance comes 
with substantial burdens, including throt- 
tling the development of new technologies, 
expending scarce enforcement resources, 
and facing political backlash. Appreciat- 
ing this should allow policy-makers and 
regulators—both public and _ private—to 
understand that different genetic biohack- 
ing activities will pose different risks and 
should merit different approaches, and to 
tailor existing regulatory mechanisms to 
mitigate genetic biohacking’s risks while 
amplifying its potential. 
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The grand story of carbon 


A geologist offers a melodic meditation on one of Earth’s 


most abundant elements 


By Nicola Pohl 


Ithough organic chemistry is often 
described as the science of carbon, 
Robert Hazen’s latest book, Syzm- 
phony in C, makes clear that this vital 
element cannot be contained by such 
a disciplinary boundary. 

Despite its abundance and importance, 
the location and cycling of carbon on Earth 
are not yet well understood. Ever- 
increasing atmospheric concen- 
trations of its dioxide form lend 
urgency to a more accurate ac- 
counting of this element. How- 
ever, it is Hazen’s enthusiasm, 
the string of shareable facts 
presented, and the introduction 
of so many interesting scientists 
that make this book such a fasci- 


Symphony in C 


personal perspective to this topic as head 
of the Deep Carbon Observatory (DCO), 
which brings together hundreds of scientists 
around the world to understand how carbon 
moves in all its forms in and around our 
planet. In 2015, the DCO started the “Carbon 
Mineral Challenge” to find the almost 150 
carbon-containing minerals predicted to ex- 
ist on or near Earth’s surface. Some of these 
predicted minerals, including a sodium lead 
carbonate called abellaite and 
green middlebackite, have already 
been found. 

The extreme pressures and tem- 
peratures hundreds of miles below 
Earth’s surface form denser crys- 
talline minerals, but almost none 
of the ~400 currently known car- 
bon minerals are in high-pressure 
phases. Hazen has been fascinated 


. Carbon and the . ie 
nating read. Evolution of (Almost) by experimental efforts to mimic 
Trained as a geologist, Hazen Everything these extreme pressures since he 


also has a deep love of music, 
which manifests in the symphonic 
form that superficially organizes 
the book’s content into four alchemical 
movements: earth, air, fire, and water. In 
the first section, “Earth’—the most well- 
developed of the book—Hazen discusses 
carbon-based minerals, offering a wonder- 
ful account of how mineralogy has gradually 
turned from a purely observational science 
to one that can predict missing carbon- 
containing minerals. 

Hazen brings a distinct and intentionally 
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was a graduate student, and he 
details the methods being used to 
reach 80,000-fold or greater atmo- 
spheres of pressure. Such high pressures do 
not lead only to simple structures, we learn. 
Additional surprises await when Hazen ex- 
plores deeper toward Earth’s core. 

The second section, “Air,” starts with a 
very brief history of Earth before discussing 
carbon in the air—primarily carbon diox- 
ide—and its cycling through the atmosphere. 
Hazen finishes this section with a wonderful 
discussion of the biases that humans, includ- 
ing scientists, have and the challenges they 
bring to forming a more integrated picture 
of the world. “Carbon science is like that— 
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Karen Lloyd and Donato Giovannelli investigate 
primitive microbes in Costa Rica as part of their work 
with the Deep Carbon Observatory. 


a blending of concepts and principles from 
every branch of research,” he writes. “So, per- 
haps when we seek to understand the limits 
to knowledge—the nature of the unknow- 
able—our own human limitations need to be 
placed at the top of the list.” 

“Fire” is devoted to the many ways we 
manipulate carbon, using it to create plastic 
materials, for example, as well as other prod- 
ucts of modern organic chemistry. Here, Ha- 
zen presents an almost random assortment 
of historical anecdotes about carbon-based 
materials, from methane to neoprene to pa- 
per, giving the reader a sense of the diverse 
forms carbon can take. This section is the 
shortest, and readers hoping for more detail 
might turn instead to Mark Miodownik’s re- 
cent book, Liquid Rules (reviewed in Science 
on 8 February 2019) (1). 

The final movement is the most specula- 
tive as Hazen describes the many scenarios 
that may account for the origins of carbon- 
based life forms. He makes clear his own bi- 
ases—he believes that mineral surfaces were 
crucial to the formation of the small organic 
building blocks of life, for example—as he 
gallops through a history of this field. 

The final pages of the book end with a 
very personal look at what Hazen calls the 
“human carbon cycle’—exploring how we 
consume and expel carbon—and examines 
our responsibility to our carbon-based home. 
“Tf we are wise, if we can temper our wants 
with a renewed sense of awe and wonder, if 
we can learn to cherish our rhapsodically 
beautiful carbon-rich world as it so urgently 
deserves, then we may hope to leave an unri- 
valed, priceless legacy for our children, their 
children, and all the generations to come.” 
Given the incredibly wide range of the ear- 
lier chapters, this philosophical ending fits. 

Throughout Symphony in C, science is 
presented as a living and very human en- 
deavor. Hazen fills the book with scientists 
and collaborators from around the world 
and with his own research stories. An eight- 
page insert of photos, which depict not only 
mineral structures and timelines of Earth’s 
evolution but also the men and women 
working to extend our knowledge, rein- 
forces the human part of the grand story of 
carbon. The rapid pace of research in the 
areas presented could quickly warrant an- 
other overview, but for now, this very read- 
able account should inspire a much broader 
interest in carbon. & 
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Unsettling robots and the future of art 


An Al-driven artist’s exhibition hints at, but never fully explores, the ethics of algorithms 


By Chiara Ambrosio 


y train ride from London to Oxford 
is filled with anticipation. Having 
just watched the latest season of 
Black Mirror, I anticipate the same 
blending of wit, unintended con- 
sequences, and dystopian paradox 
from the new exhibition that has 
just opened at the Barn Gallery in 
St John’s College, Oxford. 
Unsecured Futures is the solo 
show of the first ultrarealistic hu- 
manoid robot artist, Ai-Da. A versa- 
tile artist, Ai-Da dabbles in drawing 
and painting as well as sculpture 
and performance. “She” is the result of a 
large multidisciplinary team that features 
artists, curators, computer scientists from 
the universities of Oxford and Leeds, and 
Cornwall-based company Engineered Arts. 


The reviewer is at the Department of Science and Technology 
Studies, University College London, London WCIE 6BT, UK. 
Email: c.ambrosio@ucl.ac.uk 
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Unsecured Futures 
Lucy Seal, curator 
The Barn Gallery, 
St John's College, 
University of Oxford 
Through 6 July 2019 


The press release feeds my sense of expec- 
tation, promising me that she is absolutely 
unique: “Ai-Da’s ability as a robot to draw 
and paint from sight has never been 
achieved before and makes Ai-Da an artist 
in her own right, as well as a world first.” 

As I enter the gallery space, the exhibi- 
tion synopsis advises that Ai-Da produces 
art “but she is also the art herself, 
highlighting tensions of trans- 
humanism [sic], biotechnological 
advances, and how we will come 
to redefine our views of creativity.” 
And there is more: Ai-Da aims to 
raise questions on a host of press- 
ing social issues, ranging from 
how “necessary ethical discussions” about 
technology are “painfully lagging behind” 
to addressing “the frailty of our environ- 
ment.” The text concludes with a nod to 
George Orwell, Aldous Huxley, and Alek- 
sandr Solzhenitsyn. 

In three corners of the gallery, large 
screens show close-ups of Ai-Da perform- 
ing an original piece, Poetry of Consolation, 
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based on the prison literature of Fyodor 
Dostoyevsky, Oscar Wilde, and Severinus Bo- 
ethius. The artwork label, a printed A4 sheet 
of paper blue-tacked to the wall and typed 
in 12-point font, cryptically explains that the 
original texts are used “as the input data for 
AI [artificial intelligence] algorithms based 
on Markov chains.” It then swiftly moves to 
draw comparisons between prison litera- 
ture and “the pain and suffering of animals 
housed in unsuitable captive environments.” 

Along one of the gallery walls, Ai-Da’s 
pencil portraits of Charles Babbage, Alan 
Turing, Ada Lovelace (the robot’s namesake), 
Karel Capec, and Leonardo da Vinci and the 
robot’s own self-portrait are hung side by 
side—inscribing Ai-Da in the long history of 
creativity, computing, and invention. 

Ai-Da’s paintings are the most visually 
spectacular items on display. Rich tones of 
red, yellow, and green and thick brushstrokes 
suggest a human touch. The position of the 
lights draws the viewer’s gaze directly to the 
center of each work, heightening the “shat- 
tered” composition of the abstract paintings. 
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Artificial neural networks create the foundation for Ai-Da’s abstract paintings, plotting the coordinates onto a 
Cartesian plane. Artist Suzie Emery applies oil paint to canvas renderings to create the final works. Sculptures 
based on Ai-Da’s virtual renderings, including one based on a CT scan of a real bee, also appear in the exhibition. 


Labels state that coordinates from Ai-Da’s 
drawings are “plotted onto the Cartesian 
plane and then run through AI algorithms.” 
The digital versions of the artworks were 
transferred on canvas and overlaid with oil 
paint by artist Suzie Emery. 

On the train home, I cannot stop thinking 
about the paintings. How are neural net- 
works “trained” to obtain them? How does 
Ai-Da choose the color of her artworks? 

My questions are interrupted by an 
email from the exhibition’s press office, 
which kindly puts me in contact with the 
Oxford-based computer scientist behind 
the paintings, Aidan Gomez. In a clear 
and detailed email, he explains how he 
uses logs of Ai-Da’s motor instructions to 
wire the large neural network that cre- 
ates the foundational design for the paint- 
ings. He tells me that he chooses the color 
palette, clarifying, however, that “it is the 
network’s responses that dictate which 
colour lands at each point on the canvas, 
the lightness, the intensity, and so forth.” 
He shares digital proofs, which provide an 
exciting glimpse at what the images look 
like before they are transferred on canvas 
and overlaid with paint. He stresses that 
Emery’s artistic input is the decisive factor 
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in the final form of the paintings. 

Gomez’s clear answers remain over- 
shadowed—in the exhibition as well as in 
the press—by the hype around a female- 
presenting robot that produces original 
art. On my return home, I turn on the 
television and there is Ai-Da, with her 
flamboyant creator, Aidan Meller. As she 
quietly draws, Meller makes bold state- 
ments about ethics in the digital age and 
how we need to rethink creativity. A wel- 
come reference to philosopher Margaret 
Boden creeps in but remains unexplained. 

We need public engagement, he contin- 
ues. But thus far, the appearance of Ai-Da 
has produced exactly the opposite effect: 
a cacophony of fashionable terms that will 
alienate a nonacademic audience and in- 
furiate an academic one. We are already 
asking big questions about technology and 
about algorithms in particular. Virginia 
Eubanks’s Automating Inequality, Safiya 
Umoja Noble’s Algorithms of Oppression, 
and Cathy O’Neil’s Weapons of Math De- 
struction, albeit not concerned with art in 
the digital age, are only a few examples of 
how technology does not only need to be 
questioned—it needs to be questioned rig- 
orously and unpretentiously. 
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That mysterious algorithms and a ran- 
dom list of social issues live separate lives 
in the dazzling space of Unsecured Fu- 
tures is a worrying sign of what can hap- 
pen when a narrow conception of art joins 
the hype around AI and its applications. 


“Tensions of transhumanism,’ “ethics,” 
and “public engagement” become empty 
terms, waved for effect and never probed. 
The result is an overload of information 
that polarizes opinion, fosters divides, and 
obliterates the critical potential of art as 
research in its own right. 

As a philosopher, part of my job is to ar- 
ticulate questions that will take us beyond 
shallow slogans and commonplace assump- 
tions. I found depth in Suzie Emery’s brush- 
strokes, which compelled me to ask how 
Ai-Da’s paintings came to be. I found depth 
in Aidan Gomez’s honest answers about the 
uncertainties surrounding color processing 
in a neural network. I am certain that other 
collaborators, overshadowed by the lights of 
press conferences and television interviews, 
have equally interesting stories to tell. 

Giving more space to the “how” of art 
and science is difficult and time consum- 
ing. It requires investigating the details. 
But only when this component is brought 
to the fore can we begin articulating in- 
teresting questions about our relationship 
with technology. & 

10.1126/science.aay1956 
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MARINE ECOLOGY 


The biggest bloom 


Edited by Stella Hurtley 


Rubin et al., p. 43 


At 


loating mats of Sargassum seaweed in the center of the North Atlantic were first reported 

by Christopher Columbus in the 15th century. These mats, although abundant, have until 

recently been limited and discontinuous. However, Wang et al. report that, since 2011, 

the mats have increased in density and aerial extent to generate a 8850-kilometer-long 

belt that extends from West Africa to the Caribbean Sea and Gulf of Mexico (see the 
Perspective by Gower and King). This represents the world’s largest macroalgal bloom. Such 
recurrent blooms may become the new normal. —HJS 


Science, this issue p. 83; see also p. 27 


Underwater photograph taken near Key Largo, Florida, of a Sargassum mat from below 


Beyond adaptive 
immunity 

Prokaryotic CRISPR-Cas sys- 
tems defend bacterial cells from 
phage and plasmid infection. 
Strecker et al. characterized 

a CRISPR-Cas system that 
functions beyond adaptive 
immunity (see the Perspective 
by Hou and Zhang). Type V-K 
CRISPR-Cas from cyanobacteria 
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was associated with a Tn7-like 
transposon and a natural nucle- 
ase-—deficient effector Cas12k. 
Cas12k directed the insertion 
of Tn7-like transposons into 
target sites via RNA-guided Tn7 
transposition. This system was 
reprogrammed to efficiently and 
specifically insert DNA both in 
vitro and into the Escherichia coli 
genome. —SYM 

Science, this issue p. 48; 

see also p.25 


The potential for global 
forest cover 


The restoration of forested land 
at a global scale could help cap- 
ture atmospheric carbon and 
mitigate climate change. Bastin 
et al. used direct measurements 
of forest cover to generate a 
model of forest restoration 
potential across the globe (see 
the Perspective by Chazdon 
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and Brancalion). Their spatially 
explicit maps show how much 
additional tree cover could exist 
outside of existing forests and 
agricultural and urban land. 
Ecosystems could support an 
additional 0.9 billion hectares 
of continuous forest. This would 
represent a greater than 25% 
increase in forested area, includ- 
ing more than 500 billion trees 
and more than 200 gigatonnes 
of additional carbon at maturity. 
Such a change has the potential 
to cut the atmospheric carbon 
pool by about 25%. —AMS 
Science, this issue p. 76; 
see also p. 24 


Refilling the proton pump 
Proteins are dynamic. Rear- 
rangements of side chains, 
secondary structure, and 

entire domains gate functional 
transitions on time scales 
ranging from picoseconds 

to milliseconds. Weinert et 

al. used time-resolved serial 
crystallography to study large 
conformational changes in the 
proton pump bacteriorhodopsin 
that allow for redistribution of 
protons during the pumping 
cycle. They adapted methods 
used for x-ray free electron 
lasers to synchrotron x-ray 
sources. Large loop movements 
and a chain of water molecules 
were central to regenerating the 
starting state of bacteriorho- 
dopsin. —MAF 


Science, this issue p. 61 


Smaller but more ductile 
Poor ductility is one limiting fac- 
tor in widespread use of strong 
but lightweight magnesium 


sciencemag.org SCIENCE 
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alloys in cars, trains, and 
planes. The usual way to try to 
circumvent this poor ductility 
is by adding other elements, 
which can be costly. Liu et al. 
show that very small samples 
of pure magnesium are much 
more ductile than previously 
believed (see the Perspective 
by Proust). The small samples 
suppress the deformation 
twinning that causes fractures 
in larger samples. Avoiding 
this mechanism should allow 
development of high-ductility 
magnesium and other metal 
alloys. —BG 

Science, this issue p. 73; 

see also p. 30 


BEHAVIORAL ECONOMICS 
Honesty and selfishness 
across cultures 


Rationalist approaches to 
economics assume that people 
value their own interests over the 
interests of strangers. Cohn et al. 
wanted to examine the trade-off 
between material self-interest 
and more altruistic behaviors 
(see the Perspective by Shalvi). 
They distributed more than 
17,000 wallets containing various 
sums of money in 355 cities 
across 40 countries. In contrast 
to what rationalist theories of 
economics predict, citizens were 
more likely to return wallets that 
contained more money. The 
findings also reveal a high level 
of civic honesty across nations. 
—TSR 

Science, this issue p. 70; 

see also p.29 


CANCER 


Putting CART cells in idle 
Chimeric antigen receptor T 
(CAR T) cells can be an effec- 
tive cell therapy for cancer. 
Unfortunately, excessive 
activation of CAR T cells can 
occasionally cause severe, 
even lethal, toxicity. Existing 
approaches can suppress 
overactive CAR T cells, but 
these generally kill the CAR 

T cells, which abrogates both 
their toxicity and their anti- 
tumor effects. In contrast, 
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Mestermann et al. identified 
dasatinib as a drug that can 
temporarily inactivate CAR 

T cells. This helps reduce 
acute toxicity and allows the 

T cells to recover their anti- 
tumor effects after the drug is 
removed. —YN 

Sci. Transl. Med. 11, eaau5907 (2019). 


GUT MICROBIOME 
Inheriting microbiome 
variation 


n mice, the bacterial species 
in the gut microbiome are 
maternally inherited. Yardeni 
et al. found reduced gut 
microbiome species diversity 
in mice with variations in 
mitochondrial DNA (mtDNA) 
that were associated with 
increased reactive oxygen spe- 
cies (ROS) production. When 
pups were cross-fostered, 

the gut microbiome species 
after weening reflected not 
the microbiota community of 
the nursing mother but rather 
the inherited mtDNA variation. 
Pharmacological or genetic 
reductions in mitochondrial 
ROS levels increased microbi- 
ome diversity. —ERW 

Sci. Signal. 12, eaaw3159 (2019). 


CHEMICAL PHYSICS 
A panoramic view of 


photodissociation 


As light pulses get shorter in 
time, they correspondingly 
get broader in frequency. 
Kobayashi et al. take advan- 
tage of both properties of 
attosecond pulses to eluci- 
date iodine monobromide 
(IBr) photodissociation by 
detecting ultrafast bromine 
and iodine spectral shifts 
simultaneously. A prelimi- 
nary burst of light weakens 
the |-Br bond. Then, as the 
atoms fly apart, they reach a 
configuration where the bond 
vibration can couple ground 
and excited electronic states. 
The broadband probe pulse 
reveals rapid changes in each 
atom's electronic structure at 
this juncture. —JSY 
Science, this issue p. 79 


IN OTHER JOURNALS 


Ascanning electron 
micrograph of 
Escherichia coli 
bacteria, a workhorse 
for molecular and 
synthetic biology 


METABOLIC MODELING 


Edited by Caroline Ash 
and Jesse Smith 


Predicting resource use in microbes 


bacterial cell is a microscopic, self-replicating fac- 
tory that hosts thousands of interconnected chemical 
reactions. Metabolism varies with input materials and 
environment and can be manipulated in the lab to yield 
desired products. Bulovié et al. created a package of 
tools to automate construction of models of metabolite flux 
in bacteria. A key step is optimization of parameters using 
experimental data, but the predictions were robust even 
when limited information was available. The authors model 
a strain of Escherichia coli engineered to fix carbon, demon- 
strating that such models can be useful planning tools for 


synthetic biology. —MAF 
Metab. Eng. 55, 12 (2019). 


CANCER 
Microbial stages of 
colorectal cancer 


Colorectal cancers develop 
ina multistage process from 
polyploid adenomas to car- 
cinomas and more advanced 
disease stages. Increasingly, 


some bacterial species in the 
gastrointestinal microbiota have 
been linked to colorectal cancer 
development. Yachida et al. 
mapped the fecal microbiome 
contents of 616 individuals 
undergoing colonoscopy. They 
found significant increases of 
specific bacterial species with 
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COASTAL PROTECTION 
Storm protection services 


he tropics are at increasing risk of storm damage as the 
climate warms—a major worry, as coasts also concentrate 
human populations. Many coastlines in the tropics are, or 
once were, bordered by mangrove woodland. Mangroves 
colonize and stabilize the intertidal zone, not only providing 
habitats for terrestrial and marine organisms but also buffer- 
ing storm surges, winds, and saline intrusion. However, these 
forests are at risk owing to land reclamation for building, agri- 


against tropical cyclone damage. From 23 mangrove-fringed 
countries, they observed human activity by satellite mapping 

of nighttime light. Regions with narrow (6-meter) strips of 
mangrove suffer greater permanent economic loss over the 
next 6 years than do communities protected by 25 meters of 
mangroves or more, despite disaster-mitigation responses. This 
work underscores the importance of restoring natural infra- 
structure for the benefit of vulnerable communities, as well as 


culture, and other economic activities. Hochard et al. calculated _ biodiversity. —CA 
the global value of mangroves in protecting an activities 


different stages of colorectal 
cancer, indicating that these 
might be biomarkers of disease 
that can be used in diagnosis. 
—GKA 

Nat. Med. 25, 968 (2019). 


SIGNAL TRANSDUCTION 
Receptor get-together for 
hematopoiesis 


Wnt signaling is involved in 
many organismal processes. 
What is poorly understood in 
each pathway is which Wnt 
ligand requires which recep- 
tor. Blood-cell proliferation and 
specialization (hematopoiesis) 
requires specific Wnt and 
Frizzled ligand and receptor 
pairs. Grainger et al. discovered 
arole for a distinct receptor 
type in Wnt signaling. Labeling 
Frizzled9b protein in human 
cells with a peroxidase enzyme 
allowed closely associated 
proteins to be isolated and 
identified by mass spectros- 
copy. The most enriched protein 
was the epidermal growth 
factor receptor. Experiments 


inhibiting epidermal growth 
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factor receptor function also 
supported its role in regulation 
of Wnt signaling, perhaps by 
modulating receptor internaliza- 
tion. —LBR 

Nat. Cell Biol. 21,721 (2019). 


DEVELOPMENTAL BIOLOGY 


Coordinating organs 
Although individuals vary in 
size and shape, body parts are 
largely proportional. But what 
mechanisms monitor relative 
size between organs? In the 
larva of the fruit fly, Drosophila, 
adult body parts arise from 
structures called imaginal discs. 
When growth of one Drosophila 
imaginal disc is inhibited, 
similar size restriction is seen 
in other imaginal discs. Boulan 
et al. show that upon injury, 
the stress response transcrip- 
tion factor Xrp1 activates 
expression of a hormone-like 
peptide called Dilp8. Dilp8 is 
key for coordination between 
organs. Furthermore, the small 
ribosomal subunit protein 
Rps12 senses tissue growth and 
regulates Xrpl. Together, the 
Rpsl2-Xrp1-Dilp8 regulatory 


Proc. Natl. Acad. Sci. U.S.A. 116, 12232 (2019). 


a 


program maintains appro- 
priate body proportions by 
delaying growth to coordinate 
interorgan size. —BAP 


Dev. Cell 49, 811 (2019). 


APPLIED PHYSICS 
Flipping the bits 
with current 


Using magnetic fields to write 
information into memory 
storage limits the physical bit 
density of magnetic media. 
Hong et al. present a proof- 
of-principle demonstration of 
using spin-polarized currents 
instead of magnetic fields for 
the writing process, which 
could potentially increase 

this density. The magnetic 

bit was a multilayer structure 
with cobalt-iron-boron as the 
magnetic layer. A scanning 
tip—through which the spin- 
polarized current tunneled into 
the bit—was coated with a sim- 
ilar multilayer. The researchers 
were able to control the mag- 
netization of the bit with the 
current by placing the tip only 
3 angstroms away—a distance 
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Coastal mangrove forests 
near Krabi, Thailand 


that will need to be increased 
for commercial applications. 
—JS 

Appl. Phys. Lett. 114, 243101 (2019). 


CLIMATE CHANGE 
High-latitude stakes 


Sea ice in the Arctic is disap- 
pearing rapidly because of 
climate warming, a process 
which then accelerates warm- 
ing owing to the absorption 

of solar radiation by a sea 

that is no longer covered by 
highly reflective ice. How much 
additional heating would com- 
plete loss of that sea ice cover 
cause? Pistone et al. use satel- 
ite observations to estimate 
how much solar energy would 
be added to the climate system 
were all the sea ice in the 
Arctic to melt. They calculate 
that heating would increase 

by an amount equivalent to 
that which would result from 1 
trillion tons of carbon dioxide 
emissions, approximately 25 
years of current fossil fuel use. 
—HJS 


Geophys. Res. Lett. 10.1029/ 
2019GL082914 (2019). 
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METASURFACES 
Ametasurface 
polarization camera 


Imaging the polarization of 
light scattered from an object 
provides an additional degree 
of freedom for gaining informa- 
tion from a scene. Conventional 
polarimeters can be bulky and 
usually consist of mechanically 
moving parts (with a polarizer 
and analyzer setup rotating to 
reveal the degree of polariza- 
tion). Rubin et al. designed a 
metasurface-based full-Stokes 
compact polarization camera 
without conventional polariza- 
tion optics and without moving 
parts. The results provide a 
simplified route for polarization 
imaging. —ISO 


Science, this issue p. 43 


AFRICAN GENETICS 
East African genetics and 
pastoralism 


The origin and spread of 
domestic animals across the 
globe also affected the under- 
lying genetic composition of 
human populations. In Africa, 
however, it has been difficult to 
identify the impact of interac- 
tions among migrating food 
producers and local hunter- 
gatherers. Prendergast et al. 
wanted to discern the timing 
and movement of husbandry 
and pastoralism and its effects 
on foraging communities in 
Africa. They sequenced 41 
ancient eastern African human 
genomes from individuals that 
lived approximately 100 to 4000 
years ago. Surprisingly, relatively 
little genetic mixture occurred at 
the same time as the spread of 
pastoralism. —LMZ 
Science, this issue p. 44 


PROTEIN STABILITY 
Glycine N-degron 
regulation revealed 


For more than 30 years, 
N-terminal sequences have 
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been known to influence protein 
stability, but additional features 
of these N-end rule, or N-degron, 
pathways continue to be uncov- 
ered. Timms et al. used a global 
protein stability (GPS) technol- 
ogy to take a broader look at 
these pathways in human cells. 
Unexpectedly, glycine exposed 
at the N terminus could act as a 
potent degron; proteins bearing 
N-terminal glycine were targeted 
for proteasomal degradation 

by two Cullin-RING E3 ubiquitin 
ligases through the substrate 
adaptors ZYG11B and ZERI1. This 
pathway may be important, for 
example, to degrade proteins 
that fail to localize properly 

to cellular membranes and to 
destroy protein fragments gen- 
erated during cell death. —SMH 


Science, this issue p. 45 


NEURODEVELOPMENT 
Temporal code underlies 
circuit formation 


Olfactory neurons respond to 
various odorants according 
to which olfactory receptors, 
of many, they express. During 
development, axons from 
olfactory neurons that express 
the same olfactory receptor 
converge to share the same 
glomeruli. Nakashima et al. now 
show that, in mice, the neurons 
build these connections accord- 
ing to shared patterns of activity. 
When the olfactory receptor 
is triggered, it causes its cell 
not simply to fire but to fire in 
specific patterns. Neurons that 
speak the same code end up 
connected at the same glomeru- 
lus. —PJH 

Science, this issue p. 46 


CELL BIOLOGY 
Linking protein misfolding 
and innate immunity 


Multiple innate immune sensors 
undergo rapid assembly into 
large complexes known as sig- 
nalosomes. This is an essential 
step during cellular responses 
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to microbes and danger signals. 
How this process is regulated to 
avoid accumulation of poten- 
tially toxic protein aggregates 
remains poorly understood. 
Abdel-Nour et al. identified a 
pathway, dependent on heme- 
regulated inhibitor, eukaryotic 
initiation factor 2a, activating 
transcription factor 4, and heat 
shock protein B8, which controls 
the folding and scaffolding of 
innate immune sensors, allow- 
ing optimal proinflammatory 
signaling (see the Perspective by 
Pierre). The pathway appears to 
mirror the endoplasmic reticu- 
lum unfolded protein response 
(UPR), and so was named the 
cytosolic UPR (cUPR). The 
cUPR may represent a general 
mechanism to control protein 
misfolding in cells. —SMH 
Science, this issue p. 47; 
see also p. 28 


CELL BIOLOGY 
ER-phagy keeps cells 
healthy 


In eukaryotic cells, about one- 
third of all proteins are targeted 
to the endoplasmic reticulum 
(ER), which serves as a hub for 
secretory protein traffic and 
quality control. Cui et al. studied 
a protein known as Lst1 in yeast 
and SEC24C in mammalian cells 
that is involved in loading secre- 
tory cargo into vesicles that are 
delivered to the Golgi complex. 
In response to stress caused 
by starvation or misfolded 
aggregate-prone secretory pro- 
teins, Lstl acted to promote an 
additional function—ER-phagy. 
Together with autophagy recep- 
tors on the ER, Lstl targeted ER 
domains for degradation to avert 
protein aggregation, thus pre- 
serving cellular health. —SMH 
Science, this issue p. 53 
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BIOCHEMISTRY 
Oxygen sensing across 
kingdoms 


The ability to sense and respond 
to changes in oxygen levels is 
critical for most forms of life. 
To date, mechanistic studies of 
this process in mammals have 
focused on the oxygen-sensitive 
stability of a transcription fac- 
tor called hypoxia-inducible 
factor. Masson et al. discovered 
an enzymatic oxygen sensor 
in humans that is function- 
ally identical to plant cysteine 
oxidases, enzymes that control 
responses to hypoxia in plants. 
The human and plant enzymes 
convert the N-terminal cysteine 
in substrate proteins to cysteine 
sulfinic acid, a modification that 
ultimately targets the proteins 
for degradation. Oxygen sensing 
is impaired in many human dis- 
eases, and further study of the 
human enzyme could help in the 
development of strategies for 
therapeutic intervention. —PAK 
Science, this issue p.65 


NEUROSCIENCE 
Brain immune cells in 
disease 


The microglia are brain-resident 
macrophages that are impor- 
tant for homeostasis. Recent 
studies show that microglia are 
also involved in various brain 
diseases, such as multiple scle- 
rosis and Alzheimer’s disease. In 
a Perspective, Priller and Prinz 
discuss how these disease- 
associated microglia exhibit 
alterations that are specific to 
the type of disease. This specific- 
ity suggests avenues by which 
microglia might be targeted 

to overcome brain pathology. 
—GKA 


Science, this issue p. 32 
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IMMUNOLOGY 
Hypertension-induced 
alarm signal 


Human hypertension is a highly 
prevalent disease known to 

be associated with chronic 
low-grade inflammation. Zhao 
et al. used mouse models to 
look for hypertension-induced 
proinflammatory molecules that 
contribute to T cell activation 
and inflammation. They found 
consistent elevations in plasma 
levels of the alarmin molecule 
adenosine triphosphate in 
hypertensive mice. Increased 
adenosine triphosphate (ATP) 
concentrations promoted T cell 
responses by enhancing expres- 
sion of the CD86 costimulatory 
molecule on antigen-presenting 
cells, an effect mediated through 
the P2X7 purinergic receptor. 
Elevations of plasma ATP were 
also detected in a cohort of 
hypertensive human patients 
when compared with normoten- 
sive controls. Thus, ATP release 
and the ATP-P2X7 signaling axis 
represent potential targets to 
help rein in the proinflamma- 
tory sequelae associated with 
chronic hypertension. —|W 
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Matrix Fourier optics enables 
a compact full-Stokes 
polarization camera 


Noah A. Rubin, Gabriele D’Aversa, Paul Chevalier, Zhujun Shi, 


Wei Ting Chen, Federico Capasso* 


INTRODUCTION: Polarization describes the 
path along which light’s electric field vector os- 
cillates. An essential quality of electromagnetic 
radiation, polarization is often omitted in its 
mathematical treatment. Nevertheless, polari- 
zation and its measurement are of interest in 
almost every area of science, as well as in imag- 
ing technology. Traditional cameras are sensitive 
to intensity alone, but in a variety of contexts, 
knowledge of polarization can reveal features 
that are otherwise invisible. Determination of the 
full-Stokes vector—the most complete descrip- 
tion of light’s polarization—necessitates at least 
four individual measurements. This results in 
optical systems that are often bulky, reliant on 
moving parts, and limited in time resolution. 


RATIONALE: We introduce a formalism— 
matrix Fourier optics—for treating polarization 


her, 


in paraxial diffractive optics. This formalism is 
a powerful generalization of a large body of 
past work on optical elements in which polari- 
zation may vary spatially. Moreover, it suggests 
a path to realizing many polarization devices 
in parallel using a single optical element. We can 
then design diffraction gratings whose orders 
behave as polarizers for an arbitrarily selected 
set of polarization states, a new class of optical 
element. The intensity of light on a set of diffrac- 
tion orders is then dictated by the polarization 
of the illuminating light, making these gratings 
immediately applicable to full-Stokes polariza- 
tion imaging. 


RESULTS: We theoretically investigate these 
gratings and develop an optimization scheme 
for their design. Our diffraction gratings were 
realized with dielectric metasurfaces in which 


Metasurface-based polarization camera. (A) Photographic scenes contain polarized light 
that is invisible to traditional, intensity-based imaging, which may reveal hidden features. Our 
camera uses a metasurface (inset) that directs incident light depending on its polarization, 
forming four copies of an image that permit polarization reconstruction. (B to E) A plastic ruler 
and spoon are photographed with the camera. (B) A monochrome intensity image (given by 
the So component of the Stokes vector) does not reveal the rich polarization information 
stemming from stress-birefringence readily evident in (C) to (E), which show a raw exposure, 
azimuth of the polarization ellipse, and the S3 component of the Stokes vector that describes 


circular polarization content, respectively. 
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subwavelength, anisotropic structures provide 
for tunable polarization control at visible fre- 
quencies. Characterization of the fabricated 
gratings shows that they perform as designed. 
Notably, an arbitrary set of polarizations may 
be analyzed by a single unit cell, in contrast to 
past approaches that relied on interlacing of 
several individually designed diffraction grat- 
ings, increasing the flexibility of these devices. 
These gratings enable a snapshot, full-Stokes 
polarization camera—a 
camera acquiring images 
Read the full article in which the full polar- 
at http://dx.doi. ization state is known at 
org/10.1126/ each pixel—with no tradi- 
science.aax1839 tional polarization optics 
padista sna etaeacict cana id Ho maewinepanes (see 
panel A of the figure). Polarized light from a 
photographic scene is incident on the grating 
inside of a camera. The polarization is “sorted” 
by the specially designed subwavelength meta- 
surface grating. When combined with imaging 
optics (a lens) and a sensor, four copies of the 
image corresponding to four diffraction orders 
are formed on the imaging sensor. These copies 
have each, effectively, passed through a differ- 
ent polarizer whose functions are embedded 
in the metasurface. The four images can be 
analyzed pixel-wise to reconstruct the four- 
element Stokes vector across the scene. Several 
examples are shown at 532 nm, both indoors 
and outdoors. The figure depicts an example 
photograph of two injection-molded plastic 
pieces, aruler and a spoon (illuminated by a 
linearly polarized backlight), that show in- 
built stresses (see panels C to E of the figure) 
that are not evident in a traditional photo- 
graph (panel B). The camera is compact, re- 
quiring only the grating (which is flat and 
monolithically integrated, handling all the 
polarization analysis in the system), a lens, 
and a conventional CMOS (complementary 
metal-oxide-semiconductor) sensor. 


CONCLUSION: Metasurfaces can therefore 
simplify and compactify the footprint of optical 
systems relying on polarization optics. Our de- 
sign formalism suggests future research direc- 
tions in polarization optics. Moreover, it enables 
a snapshot, full-Stokes polarization imaging sys- 
tem with no moving parts, no bulk polarization 
optics, and no specially patterned camera pixels 
that is not altogether more complicated than a 
conventional imaging system. Our hardware 
may enable the adoption of polarization imag- 
ing in applications (remote sensing, atmospheric 
science, machine vision, and even onboard au- 
tonomous vehicles) where its complexity might 
otherwise prove prohibitive. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: capasso@seas.harvard.edu 
Cite this article as N. A. Rubin et al., Science 365, eaax1839 
(2019). DOI: 10.1126/science.aax1839 
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Matrix Fourier optics enables 
a compact full-Stokes 
polarization camera 


Noah A. Rubin’, Gabriele D’Aversa’”, Paul Chevalier’, Zhujun Shi’, 


Wei Ting Chen’, Federico Capasso’* 


Recent developments have enabled the practical realization of optical elements in 
which the polarization of light may vary spatially. We present an extension of Fourier 
optics—matrix Fourier optics—for understanding these devices and apply it to the design 
and realization of metasurface gratings implementing arbitrary, parallel polarization 
analysis. We show how these gratings enable a compact, full-Stokes polarization camera 
without standard polarization optics. Our single-shot polarization camera requires no 
moving parts, specially patterned pixels, or conventional polarization optics and may 
enable the widespread adoption of polarization imaging in machine vision, remote 


sensing, and other areas. 


olarization refers to the path traversed by 

light’s electric field vector. As a fundamen- 

tal characteristic of light, polarization and 

its measurement are of great interest in 

almost all areas of science and in imaging 
technology as well. Traditionally, polarization is 
spoken of as a property of a beam of light. How- 
ever, advances in the last few decades in holo- 
graphic media, micro- and nanofabrication, and 
other areas have enabled the practical realiza- 
tion of optical elements with tailored, spatially 
varying polarization properties, even on a sub- 
wavelength scale, at optical frequencies. In these 
devices, the polarization state of light can be 
varied controllably, point-to-point across an op- 
tical element. 

Work of this nature now has an extensive lit- 
erature across several disciplines of optics under 
various names, including diffractive optics (J, 2), 
polarization holography (3-6), and nanophoton- 
ics (metasurfaces) (7, 8), in addition to apprecia- 
ble attention from the liquid crystal community 
(9, 10). These devices exhibit different behavior 
depending on the polarization of illuminating 
light, so a natural question that arises is how they 
can be designed to implement many polarization- 
dependent functions in parallel. 

We consider a generalization of work in this 
area, which we call matrix Fourier optics, that 
suggests new design strategies for the design of 
polarization optics not previously achievable in 
a single element. In particular, we apply it to the 
design of metasurface diffraction gratings that 
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can analyze several arbitrarily specified polariza- 
tion states in parallel. 


Matrix Fourier optics 


We present a general way of viewing diffrac- 
tion from a polarization-dependent obstacle. In 
the plane-wave expansion (or angular spectrum) 
picture of optics, an electromagnetic disturbance 
U(a,y) in a plane can be considered as being 
formed from the interference of many plane 
waves incident at different angles. An individual 
plane wave in this set is characterized by its in- 
plane wave-vector (Ky, k,) and a weight given by 
the Fourier transform of the optical field as 


A(ky, ky) = |)_U(w.y)et"** de dy (1) 


Each plane wave with its weight A(K, ky) can 
be individually propagated forward in space to 
form the field at a different position (Fig. 1A). 
This formalism can treat light’s interaction with 
simple obstacles. If a planar obstacle having 
a transmission function ¢(a,y) (e.g., Young’s 
double slit, or a diffraction grating) is illuminated 
by an incident field of magnitude Ey (which may 
also be spatially-varying), the field immediately 
following the obstacle, ¢(#, y)Eo, can be handled 
in this way. 

This intuitive understanding of optical wave 
propagation is the basis of the field broadly known 
as Fourier optics (77) and underpins much of mod- 
ern optical physics, including imaging and holog- 
raphy. Notably, however, this picture is a scalar 
one and does not include light’s polarization. 
Nonetheless, it is possible to conceive of optical 
elements in which polarization-dependent prop- 
erties vary as a function of space. Then, instead of 
the scalar transmission function ¢(z, y), we may 


consider that an optical element is associated 
with a matrix-valued function J (x, y). Here, J(2, y) 
denotes the local Jones matrix (72), the way in 
which the Jones polarization vector is modified, 
at each point. (In this work, matrix quantities are 
denoted by a tilde.) 

We consider that such an optical element 
with a Jones matrix profile J(a, y) is illuminated 
by a field described by the Jones vector ket 
|Eo) (in acknowledgment of its polarized nature). 
The field immediately after the obstacle is then 
given by J (a, y)|Eo). This field, too, can be prop- 
agated by plane-wave expansion as given by the 
Fourier transform 


a IF. y)|Eo et) dx dy (2) 


which is now vector-valued. If the incident wave 
is anormally incident, uniform plane-wave, it will 
carry no space-dependence and can be removed 
from the integral. The only integral to be eval- 
uated, then, is given by 


+o 
A(kesky) = |) F (eye?) ax dy (3) 


which is a Fourier integral distributed across 
each of the four elements of the 2 x 2 Jones 
matrix yielding a Fourier coefficient A (Kp, ky) 
which is itself a Jones matrix (Fig. 1B). 

This matrix Fourier transform is physically 
significant. Instead of an amplitude and phase, 
as in traditional Fourier optics, a given direc- 
tion (kz, ky) is associated with a polarization- 
dependent behavior given by the Jones matrix 
operatorA (Kx, ky). In a sense, this description 
decouples the optical element described by 
J (a, y) from the polarization of the illuminating 
light: The interaction of all possible incident 
polarizations with J (@, y) is handled at once by 
the matrix Fourier transform A (kp, Ky). 

In this work, we specialize to polarization- 
dependent diffraction gratings—that is, elements 
in which J (@, y) is a periodic function and the 
angular spectrum A(kr, k,) is discrete, yielding 
diffraction orders. Diffraction gratings simply 
reveal the consequences of this matrix Fourier 
optics, in particular, that it can be inverted and 
used as a design strategy for multifunctional 
polarization optics. Suppose there is a set of 
desired polarization devices to be realized (e.g., 
polarizers, waveplates, optically active elements, 
or any behavior that can be described by a Jones 
matrix) having the Jones matrices {J;,} and a set 
of diffraction orders {¢}. The Fourier series 
expansion 


J (a,y) = > Frelbortleyy) 


ke{0} 


(4) 


represents a single optical element realizing all 
of these (potentially complicated) polarization- 
dependent functions in parallel with each order 
implementing its own polarization device (Fig. 1C). 
In the case of a grating, the Fourier transform 


lofs 
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Eq. 3 becomes a Fourier series in Eq. 4 with the 
optical elements {J;,} as coefficients. We refer to 
such a diffraction grating as a matrix grating. 

This way of viewing polarization-dependent 
diffraction has not seen wide acknowledgment 
or use in the literature of polarization [with 
some limited exceptions in diffractive optics 
(13, 14) and polarization ray-tracing (15, 16)]. 
In acommon strategy, optical elements—especially 
in the field of metasurfaces—are often designed 
so that when a given polarization is incident, the 
output polarization state is uniform across the 
element and a scalar phase profile is imparted. 
When the orthogonal polarization is incident, 
the output polarization state is again uniform 
and a second phase profile is experienced. In this 
way, optical elements with different functions for 
chosen linear, circular, and arbitrary elliptical po- 
larization bases (7, 8, 17) can be realized. Notably, 
what is widely referred to as the “geometric” or 
“Pancharatnam-Berry” phase, at least as it relates 
to this problem, is a subcase of that approach, 
being used to create scalar phase profiles for 
circularly polarized light (2, 9, 18-21). In other 
past work, polarization is understood to vary 
with space, but a particular incident polariza- 
tion state is assumed (22, 23). These past design 
strategies are subcases of the matrix approach 
presented here. 


Parallel polarization analysis by unitary 
polarization gratings 


The formalism presented in the last section is 
general. Equation 4 provides a prescription for 
the realization of a diffractive element imple- 
menting arbitrary polarization behavior, but 
does not specify the nature of the desired opti- 
cal functions (contained in {Je}) or how the re- 
sulting J (@, y) should be practically realized. 

We focus on cases in which {J;,}, the behaviors 
implemented by the grating orders, are polariza- 
tion analyzers. This choice is not a fundamental 
one, but it is highly relevant for practical appli- 
cations: Analyzers are perhaps the most fun- 
damental polarization element and, moreover, 
devices that project light onto different states of 
polarization are of practical use in polarimetry, 
the measurement of light’s polarization state 
(24). A diffraction order k behaving as a polarizer 
can be described by a Jones matrix that is a dyadic 
(outer product) expressed as 


Je = Ox|Pr) (Gel (5) 


Equation 5 describes a Jones matrix analyzing 
for the chosen Jones vector |q,) in accordance 
with Malus’ law: When |q;,) is incident, inten- 
sity transmission is maximum. If instead the or- 
thogonal polarization |q;,) with (q;,|¢r) = O arrives, 
the output light is quenched. The light emerging 
from J, will carry the polarization state |p,). (A 
traditional polarizer familiar from laboratory ex- 
perience has |p;) = |qz,)). Finally, ay is a complex 
(scalar-valued) weight. 

Once the desired analyzers {J,} are specified 
(Eq. 5), the grating J (a, y) can be derived by Eq. 4. 
What physical optical element should implement 
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the resulting J (a, y)? A highly convenient real- 
ization medium takes the form of metasurfaces 
(25, 26)—subwavelength-spaced arrays of nano- 
photonic phase shifters—composed of dielectric 
pillars possessing form birefringence (7, 8). Locally, 
the Jones matrix of these metasurfaces may be 
well-approximated by a linearly birefringent wave- 
plate (8) given by 


J(@,y) = 
ty (ay) 
R(0(@.u)) (6 voyiew))R(—0(2,9)) (6) 

Stated differently, such a metasurface can re- 
alize a sampled matrix grating where J (@, y) is 
locally of the form of Eq. 6. The use of dielectric 
metasurfaces obeying Eq. 6 is, again, not a fun- 
damental choice, but an especially convenient 
one: 6,, 6,, and @ are all easily and continuously 
adjusted by varying the dimensions and angular 
orientation of a simple dielectric pillar, easily 
fabricated lithographically (Ris a 2 x 2 rotation 
matrix). We refer to metagratings structured 
from these elements simply as “gratings,” but in 
contrast to more generic diffraction gratings, 
it should be understood that the gratings here 
possess special polarization properties owing to 
their subwavelength features. 

By inspection, Eq. 6 describes a J ones matrix 
that is (i) unitary everywhere—thatis,J J = 1 
for all (@, y), with 1l the 2 x 2 identity matrix— 
and more specifically, (ii) linearly birefringent, 
having linear polarizations as its eigenvectors 
for all (a, y). 

From the completely general matrix picture 
presented we have made two specific choices that 
the diffraction orders of the grating should behave 
as analyzers and that the grating should locally re- 


Fig. 1. Matrix Fourier Optics. A 
(A) A scalar optical field U(x, y) 

has a plane-wave expansion 

A(kx, ky) obtained by Fourier 
transform (F) that permits its 
propagation forward in space. 

(B) An obstacle implementing 

a2 2 Jones matrix J x,y) that 
varies with space (and thus 
contains four scalar functions, 
which are schematically repre- 
sented as different planes 

here) has a Fourier-domain Cc 
representation A(kx, ky) that is 
also a Jones matrix. A(ky, ky) 
encodes the behavior of direc- 
tion (ky, ky) as a function of 
incident polarization. (C) If 


J(x, y) represents a periodic 
grating, its orders are 
described by Jones matrix 
coefficients Ji, 4,), each 
describing an independent 
polarization device, for 
instance, a configuration of 


semble Eq. 6, implying the above two constraints. 
But are these choices mathematically consistent 
with one another? That is, for a general set of dif- 
fraction orders {/} implementing the polariza- 
tion analyzers J;, = Qx\Pr)(Qe| for ke {2}, will 
the J (a, y) mandated by Eq. 4 obey the form of 
Eq. 6 for all (@, y)? 

This question is a matrix analog of extensive 
work conducted on phase-only gratings (27, 28). 
We show in the supplementary text (section S1) 
that the linear birefringence required by Eq. 6 im- 
plies that the J;, must be symmetric mandating 
that each take the form J;, = ax|G;,) (Qe|- That is, 
the output polarization of each analyzer must be 
the complex conjugate of the Jones vector being 
analyzed for; this is equivalent to a mirroring 
about the equatorial plane of the Poincaré sphere, 
a change in sign of the third (chiral) Stokes com- 
ponent. Physically, this means that the analyzer 
leaves the polarization ellipse’s shape unchanged 
while reversing the handedness of its rotation, a 
generalization of a key conclusion of (7) and (8). 

Accepting this constraint guarantees linear 
birefringence (that is, matrix symmetry) for all 
(a, y), but not necessarily unitarity everywhere. 
This unitarity restriction in particular implies 
that, if we insist on a matrix grating with exactly 
n orders that are polarization state analyzers, the 
form of Eq. 6 can only be matched everywhere 
if nm = 2 and the polarizations analyzed for are 
strictly orthogonal (as proven in supplementary 
text section S1) (17). 


Design strategy and optimization 


If we insist on using a single-layer metasurface 
obeying Eq. 6 everywhere, then we may not have 
all light confined to an arbitrary set of diffraction 
orders acting as polarization state analyzers. But 


Teas 


birefringent plates. The grating implements many such devices in parallel. 
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what if some light is allowed to leak into other 
diffraction orders? Then, the J;, of some orders 
could behave as analyzers for arbitrary polar- 
ization states with a limited amount of leakage 
into other diffraction orders (implementing their 
own J;) so that the overall grating formed in Eq. 4 
is still unitary everywhere. In other words, the 
grating can function as a coupled system in which 
neighboring diffraction orders compensate for 
desired polarization-dependent behavior on a 
selected few in a way that preserves overall uni- 
tarity for all (a, y). 

Here, we seek to design a grating J (a, y) that 
locally obeys Eq. 6 with a set of diffraction orders 
{¢} each having J;, that are analyzers for an 
arbitrarily specified set of polarization states 
{|qx)} as defined in Eq. 5 with as little light as 
possible leaking into diffraction orders out- 
side of {}. This is a question of optimization. We 
define the quantities I id = (dk [7 Tn lax) , the 
power on diffraction order k when the pre- 
ferred polarization |q;,) is incident, and Tf, = 
(% [7 is ke lat), the power on diffraction order k 
when the orthogonal polarization |g;) with 
(qld) = Vis incident. The sum > Ii should 

ke{l 

be maximized to keep as much light 2 possible 
in the orders of interest. Simultaneously, the 
contrast of each order given by yn = (r - Ik )/ 
(Ii + If, )—the polarization sensitivity of the order 
to the desired polarization |q;,)—should be as 
close to unity as possible to constrain the J;, to 
act as the desired analyzers (in the sense of Eq. 5). 
This can be addressed with gradient-descent. 
We reserve detailed discussion of this optimiza- 
tion to the supplementary text [sections S1 and 
$2, where we also describe a path to an analyt- 
ical form of the solution using variational methods 
(77, 27, 28)]. 

Many previous works have considered dif- 
fraction gratings capable of splitting, and thus 
analyzing, light on the basis of its polarization 
state. However, these works have generally taken 
a scalar approach, seeking to impart opposite 
blazed grating phase profiles on orthogonal po- 
larization states. Consequently, several individ- 
ually designed gratings must be interlaced (often 
called “spatial multiplexing” or “shared aperture”) 
(29-32) or, equivalently, cascaded in series (1, 33) 
to create a grating whose orders analyze for any 
more than two polarization states. This is in- 
herently problematic: These configurations can- 
not implement polarimetry with any less than six 
measurements (whereas four is the minimum 
required), compromising sensor space. Moreover, 
the polarization states of the analyzers cannot be 
arbitrarily specified. Interlacing different gratings 
also introduces unwanted periodicity, mandating 
loss of light to out-of-plane diffraction. The matrix 
approach of this work shows that interlacing is 
not necessary—all functions can be integrated 
into a single grating—and moreover, affords 
possibilities not achievable by interlacing. The 
tetrahedron grating that we will present, as a sim- 
ple example, is not possible by simple interlacing 
as none of its four analyzer states are orthogonal 
to any of the others. 
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Experiment 

Tetrahedron grating design 

Gratings implementing parallel polarization anal- 
ysis are of practical interest for Stokes polarime- 
try (24, 34) which requires a minimum of four 
projective polarization state measurements. For 
maximum fidelity of Stokes vector reconstruction, 
these four states should be as distinct from one 
another as possible, accomplished by choosing 
analyzer states corresponding to a tetrahedron 
inscribed in the Poincaré sphere (polarization 
state space) (35-37). We use the matrix formal- 


Fig. 2. Matrix gratings for A 


ism and optimization scheme described above 
to design a two-dimensional diffraction grating 
analyzing for these four polarization states on 
its innermost four orders. The leftmost panel 
of Fig. 2A gives a map of these orders and the 
polarization ellipses ({|g;,) }) they are designed 
to analyze for in k-space. These diffraction 
orders and desired polarization states can be fed 
into the aforementioned optimization, yielding a 
numerical J (@, y) that is locally of the form of 
Eq. 6 (unitary and linearly birefringent). It is 
then straightforward to map the locally required 


arbitrary parallel polarization O 


analysis. (A) A 2D grating 
unit cell is designed to analyze f 


four polarization states 
corresponding to a tetrahedron 
inscribed in the Poincaré 
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sphere. On the left is a map of 
the diffraction orders and B 
the polarization ellipses they 

analyze in k-space. The designed 
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: , Order of 
11 x 11 element grating unit interest ya 
cell containing TiO2 rectangular — \ - 
pillars implementing this at Munn) 
4 = 532 nm is shown in design Polarization 
(middle) and as-fabricated control Ful sth 
polarimeter 


[scanning electron micrograph 
(SEM), right]. Scale bar, 1 um. 
(B) The grating is illuminated by Cc 


light whose polarization is 1.00 | 
varied while recording the 3 
output polarization on a single a 0.95 + 
diffraction order with a full- 2 
Stokes polarimeter permitting 8 0.90 L 


reconstruction of the order's 
4 x 4 experimental Mueller 
matrix Minn). (C) The 
polarization contrast of each 
order is shown. Each order 

is labeled by the polarization 
ellipse it analyzes, and results 
from the analytical grating 
design (as-optimized, red), a 
full-wave simulation of the 
grating (blue), and experiment 
(green) are given. (D) The 
polarizations analyzed by each 
order [for which they have 

the contrasts given in (C)] of 
the tetrahedron grating are 
shown on the Poincaré sphere 
alongside a tetrahedron 
indicating the desired analyzer 


\ 


@|dmn) @|Ginn) @ Average output polarization 


polarizations as predicted by the optimization, a full-wave simulation, and as-measured. (E) A set of 
1024 uniformly sampled input polarization states on the Poincaré sphere can be operated on by 
an experimentally determined Mueller matrix Monn) to form a distorted set of states depicting output 
polarization as a function of input. Each sphere on the right represents the behavior of one of the 
four engineered grating orders, whose designed analyzer polarizations are shown. On each, a blue 
arrow denotes the measured analyzer polarization |qimn)) from (C), a red arrow the equator-mirrored 
polarization IQimny)» and a green arrow the average output polarization over all points which, according 
to the formalism here, should overlap with the red one. (In some cases, the overlap of red and green 


arrows is too close to permit visual discrimination.) 
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Jones matrix (o,, 6,, 8) to geometries of actual 
structures by referring to a library of such struc- 
tures in a material platform/wavelength regime 
of interest (8). 

In this work, that material platform is TiO, 
pillars fabricated with an e-beam lithography 
and atomic layer deposition technique exten- 
sively documented elsewhere (38). This permits 
operation at technologically important visible 
wavelengths, aiding the camera application dis- 
cussed below. We stress, however, that neither 
TiO, nor visible wavelengths are central to this 
work. The grating unit cells presented here have 
11 such elements to a side with an interelement 
separation of 420 nm so that the diffraction angle 
at’ = 532 nm is 89p~6.6° (paraxial). The subwave- 
length spacing of the pillar elements themselves 
assures that radiative orders may only stem from 
the collective unit cell of 11 x 11 elements. The 
grating unit cell designed by optimization for the 
tetrahedron case is shown in Fig. 2A (top and 
bottom, respectively), both in design and as- 
fabricated (electron micrograph, right). This 
unit cell is tessellated hundreds of times to create 
a grating. 


Fig. 3. Metagrating full- A 
Stokes polarization 


camera. (A) A matrix i asia 
i facet : photographic 
metagrating (as in Fig. 2) is eccne 


integrated with an aspheric 
lens to image four diffraction 
orders onto four quadrants 

of a CMOS imaging sensor. 

A ray trace of one diffraction 
order is shown. The camera is 
designed to image far-away 
objects, so each color 
corresponds to different 
parallel ray bundles incident 
on the grating from different 
angles over a +5° FOV. Not 
shown: A 10-nm bandpass 
filter at 532 nm and an aper- 
ture in front of the grating to 
imit the FOV to prevent 
overlap of the four subimages. 
B) Each copy of the image 
on each quadrant has been 
analyzed along a different 
polarization. Pixel-wise 
differences in intensity can be 
used to synthesize a single 
polarization image of the 
scene in which the full Stokes 
vector is known at each point. 
C) A clearer side view of 

the ray trace in (A). Blue, 
orange, and green correspond 


Mueller matrix polarimetry and results 
In polarization optics, the Jones and Mueller 
matrices describe polarization-dependent behav- 
ior. Of the two, only the Mueller matrix is directly 
observable, being a description of optical inten- 
sities rather than electric fields. To that end, we 
perform Mueller matrix polarimetry, that is, the 
experimental determination of the 4 x 4 Mueller 
matrix (m,n) Of each of the four grating orders 
(m,n) of interest (as a 2D grating, the orders are 
labeled by two integer indices). The grating is 
illuminated with laser light at 1 = 532 nm in 
several (at least four) different input polarization 
states with known Stokes vectors {Sin}. As the 
polarization switches between the different known 
inputs, a full-Stokes polarimeter can be placed on 
the order of interest to record the corresponding 
output Stokes vectors, {Sour}. The matrix M (man) 
linking {Sin} to {Sou} can then be numerically 
determined from the data in the least-squares 
sense. This is sketched in Fig. 2B and discussed in 
more detail in supplementary text section S3 (17). 
In analyzing the results, it is difficult to make a 
direct comparison of matrix quantities. Instead, 
because each order of interest is designed to act 


Imaging lens 
(f = 20 mm asphere) 


to ray bundles incident at +5° 0° and —5° respectively. (D) Optical microscope image of the grating 
sample, which is 1.5 mm in diameter and shows Fresnel zones owing to the weak, polarization- 
independent lensing effect imposed on top of the metagrating. An SEM inset shows the subwavelength, 
form-birefringent TiOz pillars comprising the metasurface. (E) The imaging system in (A) can be 
packaged into a practical, portable prototype with adjustable focus. The essential part of the camera, 
as shown in (C), is about 2 cm long. The prototype here is larger for ease of optomechanical mounting. 
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as a polarization analyzer, it makes sense to ask 
the following questions: Do the orders act as 
analyzers? For which polarizations? And, with 
what efficiency? 

The first question is addressed by Fig. 2C, 
which plots the polarization contrast of each 
diffraction order. Each group of bars in Fig. 2C 
corresponds to the diffraction order designed to 
analyze for the polarization ellipse shown below 
it. Each group contains three color-coded bars: 
one for the numerically optimized J (@, y), one 
for a full-wave simulation, and one for measure- 
ment. On the left of Fig. 2D, the contrast (the 
normalized difference between the intensity on 
that order when its preferred polarization is inci- 
dent versus the orthogonal one) is plotted. An 
ideal polarizer would have 100% contrast. The 
numerically optimized result predicts near per- 
fection (100%). In a simulation of the grating 
design, the contrast decreases somewhat, and 
then again as measured in actuality. However, 
the four orders of the measured grating all show 
polarization contrasts in excess of 90%—the de- 
signed orders of the grating do act as polarizers 
(analyzers). 

Second, it must be verified that the orders act 
as analyzers for the polarization states specified 
in the design. In Fig. 2D, the polarization states 
for which each grating order has maximum out- 
put intensity (as-optimized, in simulation, and 
as-measured) are plotted on the Poincaré sphere 
alongside the tetrahedron representing the goal 
of the design. It can be seen that these analyzer 
polarizations are close to their desired counter- 
parts (there are no notable mixups in Fig. 2D, 
whereby one analyzer polarization lands very far 
away appearing to be closer to another, falsely 
exaggerating correspondence). 

Figure 2, C and D, shows that the metasur- 
face grating implements four arbitrarily specified 
analyzers in parallel with no polarizers or wave- 
plates, lending validity to the matrix approach 
underlying its design. We address the question of 
efficiency in the supplementary text (section S3) 
(17). The grating’s diffraction efficiency cannot be 
quantified with a single number—it is polarization 
dependent. Averaged over all possible input polar- 
izations, efficiency in terms of power diffracted 
into the four orders of interest over incident power 
exceeds 50%, high enough to enable practical use. 

The data contained in Fig. 2, C and D, are de- 
rived from the first row of the Mueller matrix, 
which dictates the intensity Sp of the outgoing 
beam. The remainder of the Mueller matrix con- 
trols the output beam’s polarization state. This 
can be visualized by individually operating on a 
set of input Stokes vectors that uniformly sample 
the Poincaré sphere (1024 dots) with each order’s 
measured Mueller matrix (mn) and plotting 
the set of Stokes vectors that result as a new, 
distorted set of dots. This is shown for each of the 
four orders of interest in Fig. 2E on four spheres. 
If each behaved as a perfect analyzer (compare to 
Eq. 5), all polarizations would be mapped into a 
single point in the distorted diagram at the state 
corresponding to |p;,). The grating is not perfect 
in experimental reality, so the spheres in Fig. 2E 
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Fig. 4. Full-Stokes polarization imagery. Indoor (A to C) and outdoor 
(D and E) photography with the camera depicted in Fig. 3. In each 
case, the raw unprocessed exposure, So (the traditional monochrome 
intensity image), the azimuth of the polarization ellipse [in degrees, 
given by arctan(S2/S;)], and the degree of polarization (DOP, given by 


\/S? + S5 + S3/So) are shown. See text for detail on each case. Indoor 


contain points distributed over the entire sphere; 
the extent to which these points are concentrated 
in one direction and extremely sparse elsewhere 
illustrates that each diffraction order does in- 
deed act as an analyzer. For each grating order in 
Fig. 2E, a blue arrow corresponds to the polariza- 
tion being analyzed (copied from Fig. 2D) (q(m.n)) 
in the notation of Eq. 5). A green arrow shows 
the polarization on the grating order averaged 
over the set of incident polarizations. Finally, a 
red arrow depicts \Qnny) , which has flipped 
handedness with respect to |@(m,n)) and is thus 
mirrored about the equator of the sphere. Ac- 
cording to the picture presented above, the linear 
birefringence of the grating elements implies 
that the output must necessarily take on the 
form of \Qm.n))* This aspect of the theory is sup- 
ported by the close overlap of the red and green 
arrows in each plot of Fig. 2E. 

In the supplementary text, results are given 
for a second grating, an octahedron grating, in 
which six diffraction orders act as analyzers (17). 


Full-Stokes polarization imaging 


We now apply these matrix gratings in an area 
of great practical interest. If paired with four de- 
tectors, the tetrahedron grating presented above 
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could function as a single-component, compact, 
and integrated full-Stokes polarimeter (a sensor 
to measure the polarization state of a beam), an 
area where metagratings (29-32) and integrated 
approaches (39, 40) have recently attracted con- 
siderable interest (34). However, for a variety of 
applications (41-44), a polarization camera, or 
imaging polarimeter, is of even more utility. A 
polarization camera captures the Stokes vector 
at each point in an image. In the case of the four- 
element Stokes vector, this necessitates four 
independent image acquisitions along indepen- 
dent polarization directions, which may be taken 
sequentially in time (“division-of-time,” limiting 
temporal resolution and often requiring moving 
parts), by patterning a focal plane array with 
micropolarizers (“division-of-focal-plane,” requir- 
ing expensive fabrication, usually without of- 
fering full-Stokes vector determination, and 
mandating loss of photons to absorptive micro- 
polarizer elements), or by simultaneous capture 
of the image along four paths each with indepen- 
dent polarization optics (“division-of-amplitude,” 
substantially increasing system bulk and com- 
plexity) (47). 

Grating-based approaches are not new to po- 
larimetry (24, 45) (and imaging polarimetry), 
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images were acquired with exposures on the order of 100 ms, outdoor 
images with exposures on the order of 10 ms. In both cases, polarization 
imagery can be acquired at video framerate. The bright disk in the center 
of each raw exposure is zero-order light that does not interact with the 
grating and thus forms a defocused image of the scene. All images are at a 


single color (green, 4~ 532 nm). 


particularly in the liquid crystal (46) and, more 
recently, metasurface literatures (29-32). OWing 
to the limitations of previous approaches, how- 
ever, multiple gratings—either cascaded in series 
or patterned adjacent to one another—are re- 
quired to implement the measurements neces- 
sary for full-Stokes vector determination. The 
matrix gratings here are free from this complica- 
tion and full-Stokes imaging can be implemented 
with a single polarization element, promising 
wide-ranging applications in machine vision and 
remote sensing. 


Design of an imaging system 


The task here is to integrate a metagrating into a 
photographic imaging system. The tetrahedron 
grating described above is chosen because it 
offers full-Stokes determination with only four 
measurements, the minimum necessary. We 
developed an imaging system composed of this 
grating (implemented with a TiO, metasurface 
as above) followed by an aspheric lens (f= 20 mm, 
whose choice is discussed in supplementary text 
section S4) and a standard monochrome com- 
plementary metal-oxide-semiconductor (CMOS) 
imaging sensor. This is depicted in Fig. 3A. 
Relative to Fig. 2A, the grating is rotated by 45° 
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so that each of its orders forms an image of the 
scene on one quadrant of the imaging sensor. 
Each quadrant then contains a version of the 
photograph analyzed along its characteristic 
polarization—these images can be simultaneously 
acquired and the Stokes vector S reconstructed 
pixel-wise (Fig. 3B), forming a monochromatic 
polarization image. 

Simple ray tracing is used to optimize all as- 
pects of the system: the separation of the grating 
and the asphere, the separation of the asphere 
and the imaging sensor, and finally, the grating 
period (and thus, the diffraction angle 0p). The 
goal of the optimization is to take parallel ray 
bundles over a+5° field-of-view (FOV)—which are 
assumed to emerge from a very distant object— 
and focus them within the bounds of a quad- 
rant of the sensor. An azimuthally symmetric, 
polarization-independent (scalar) phase profile 
is added on top of the matrix grating during the 
design and is experienced by all diffraction orders. 
This produces a weak lensing effect that aids in 
imaging. The design focuses on just one grating 
order. However, the other three will be imaged to 
their respective quadrants by default because the 
system is rotationally symmetric. 

A real ray trace of the imaging system is shown 
in Fig. 3A with a clearer side view in Fig. 3C. The 
ray colors correspond to parallel bundles inci- 
dent at different angles on the matrix grating. 
The optimized grating has N = 10 elements at a 
420-nm pitch, yielding 9p = 7.3° at A = 532 nm. 


Unstressed 
acrylic 


3D Glasses 


Raw 


So 


In Fig. 3D a microscope image of the 1.5-mm 
sample is shown. The sample is surrounded by 
metal to block stray light and displays Fresnel 
zones stemming from the weak lensing imparted 
on top of the grating (whose periodicity is too 
small to see at this magnification). 

Finally, the entire system can be packaged 
into a prototype for practical use in polarization 
photography as shown in Fig. 3E with variable 
focus. The size of this prototype is much larger 
than is strictly necessary to permit easy opto- 
mechanical mounting—as is shown in Fig. 3C, 
the functional part of the system is only ~2 cm 
long. An aperture in front of the camera permits 
control of the FOV so that the four copies of the 
image do not overlap, and a 10-nm bandpass 
filter at 532 nm behind the sample prevents the 
dispersive nature of the grating from interfering 
with imaging. This is important to note — ifa 
broad band of illuminating wavelengths were 
introduced to the imaging system, the grating 
would effectively smear the images together, with 
spatial and spectral information colocated on the 
sensor. The bandwidth of dye filters used in con- 
ventional color sensors (~100 nm for the Bayer 
filter) is too wide to effectively address this. For 
color or hyperspectral polarization imaging, a 
different approach could be employed, such as the 
use of a tunable filter at the input or incorporation 
of the grating into a pushbroom scanning system 
where only one space dimension is acquired at a 
time (as is common in remote sensing). 
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Fig. 5. Polarization imaging of S3. Example imagery from the camera making explicit use of S3, part 
of its full-Stokes capability. All images are acquired with a backlight linearly polarized at 45° In each 
example, raw sensor acquisition, So (traditional intensity image), and S3 are shown. (A) 3D glasses 
are seen to contain opposite circular polarizers, invisible to the traditional intensity image. (B and C) A 
laser-cut acrylic piece is stressed by hand-squeezing and displays stress birefringence evident in the 
S3 image. (D) An injection-molded plastic part (a tape dispenser) has complex, in-built stresses 
that are visible in the S3 channel of the polarization image. Edge artifacts in these images and those 
in Fig. 4 are discussed in the supplementary text (section S4) (17). 
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Polarization imaging 

The metagrating camera can then be used for 
practical full-Stokes photography. The raw sen- 
sor acquisition approximates Fig. 3B. To form a 
polarization image, the four image copies must be 
aligned (registered) to one another, forming the 
vector I = [Ip I In I ie of measured intensity 
from the four quadrants at each pixel. If the po- 
larizations analyzed for by each diffraction order 
are precisely known (calibration), the Stokes vec- 
tor at each pixel can be computed as S = Ay, 
where A is a matrix whose rows are these ana- 
lyzer Stokes vectors. Image registration and polar- 
imetric calibration are addressed simultaneously 
with an angle-dependent method (supplementary 
text section S4) (17). - 

Ina polarization image,S = [So S, Sy S3 ie is 
known at each pixel, which does not admit easy 
visualization. Instead, images can be formed from 
scalar quantities derived from the Stokes vector. 
In the literature of polarization imaging, two pa- 
rameters of the polarization ellipse are commonly 
used. The first is the azimuth angle, arctan(Sy /S,), 
which yields the physical orientation of the po- 
larization ellipse. The second is the degree of 
polarization, given by p = \/S7 + S} + S3/So, 
which quantifies the degree to which light is (or 
is not) fully polarized. Finally, an image can be 
formed of just Sp which, as the intensity of the 
light, yields a traditional monochrome photograph. 

In Fig. 4, exemplar photographs captured by 
the polarization imaging system are shown. For 
each, images of the raw acquisition on the sen- 
sor, So, the azimuth angle, and the degree of 
polarization (DOP) are shown. In all of these 
images, the illuminating light is unpolarized 
and diffuse, that is, incident from all directions. 
The indoor images in Fig. 4 are taken under 
diffuse light-emitting diode illumination, whereas 
outdoor images are acquired in broad daylight. 
In the raw exposures, a bright disk in the cen- 
ter represents zero-order light that does not 
interact with the grating and thus forms a de- 
focused version of the image (some stray light is 
also present). 

Next, we describe each example. Figure 4A 
constitutes a simple test—a paper frame holding 
eight sheets of polarizing film whose axes are 
arranged radially outward with image-forming 
light allowed to transmit from behind. A tra- 
ditional photograph sees no difference between 
the sheets (So), but an image of the azimuth ac- 
curately reveals their angular orientations. More- 
over, the DOP image shows that light passing 
through the sheets is highly polarized relative to 
the surrounding surface. 

Figure 4, B to E, examine the polarization de- 
pendence of specular reflection (47). Unpolarized 
light becomes partially polarized upon specular 
reflection in a direction perpendicular to the 
plane of incidence. In Fig. 4B, a conventional 
plastic soda bottle is imaged head-on. From 
an intensity image (So), the bottle’s curved 
shape could not be ascertained a posteriori. 
The azimuth image, however, displays smooth, 
continuous change around the top of the bottle 
evidencing its conical shape and could be used 
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in three-dimensional (3D) reconstruction; in- 
deed, the polarized nature of specular reflection 
has been used as a means of depth imaging 
(47-50). Figure 4C illustrates the same concept 
for amore complicated object — the face of one 
of the authors. Its 3D nature is not discernible 
from the Sp image (as opposed to, say, a mere 
printed photograph of a face), but the azimuth 
image traces its contour and could be used in 
3D facial reconstruction. 

Figure 4, D and E, depict outdoor scenes. In 
Fig. 4D, a bicycle is seen parked on a grassy 
field after rain on the Harvard campus. In front 
of it is a boundary between grass and asphalt, as 
well as a puddle. In the azimuth image, a strong 
delineation is seen between the wet pavement, 
where the polarization direction is well-defined 
parallel to the ground, and the grass and puddle, 
where the azimuth is somewhat random because 
the reflection is diffuse. Moreover, the azimuth 
reveals the presence of an asphalt walkway in 
the rear of the image, which is difficult to see in 
the intensity image. In Fig. 4E, a row of cars is 
photographed. Cars illuminated by sunlight are a 
favorite target in polarization imaging literature 
because the windshields tend to yield strong 
polarization signatures. This is seen for all three 
cars in the azimuth image, where the windshields 
and auto bodies have definite polarization azi- 
muths and in the DOP image where the wind- 
shields are highly polarized relative to the rest 
of the car and background. 

We note, however, that the examples in Fig. 4 
make only indirect use of S3, the chiral Stokes 
component, through the DOP. Many existing po- 
larization cameras, particularly using the division- 
of-focal-plane approach, are unable to measure 
S3 (42). Figure 5, however, directly demonstrates 
the full-Stokes capability of the camera. In Fig. 5, 
objects are illuminated from behind by light 
linearly polarized at 45° from a liquid crystal 
computer display. In each case, the raw sensor 
acquisition, So, and S3; are shown. Figure 5A de- 
picts a pair of 3D glasses whose frames contain 
opposite circular polarizers. This is not visible 
in the traditional intensity image So, but is 
readily seen in the S3 image where each lens has 
a+ value. In Fig. 5B, a piece of acrylic is held in 
front of the camera, displaying little chiral com- 
ponent. Upon squeezing (Fig. 5C), no change is 
perceptible in the traditional photograph, but 
considerable information is now visible in S3 
stemming from stress birefringence (the photo- 
elastic effect). This is also evident in Fig. 5D 
where an injection molded plastic part—a tape 
dispenser—displays an in-built, complicated 
distribution of stress in S3 not visible with tra- 
ditional photography. Visualization of stress fields 
is an important application of polarization imag- 
ing, one that is readily accomplished with the 
camera presented here. 

These examples demonstrate powerful appli- 
cations in machine vision, remote sensing, and 
other areas. The camera presented here requires 
no conventional polarization optics, offers simul- 
taneous data acquisition with no moving parts, 
does not require an imaging sensor specially 
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patterned with micropolarizers, and is compact 
and potentially mass-producible. Moreover, the 
camera’s simplicity—just a single grating with an 
imaging lens—suggests that these gratings could 
be designed around existing, conventional imag- 
ing systems to create polarization-sensitive ones. 


Conclusion 


We have introduced a general picture of polarization- 
dependent diffraction from polarization-sensitive 
obstacles. This matrix Fourier optics describes 
optical elements that can enact many polarization- 
dependent functions in their diffraction patterns. 
In this work, we applied this picture to the case 
of periodic gratings analyzing arbitrary polar- 
izations in parallel, characterized these gratings, 
and showed how they enable a polarization cam- 
era with no additional polarization optics, moving 
parts, or specialized sensors. Practical polarization 
photography with this camera was demonstrated. 

Our work generalizes a large body of research 
in polarization-sensitive diffractive optics and 
metasurfaces. Its emphasis is on the collective 
behavior of many elements at once as expressed 
in the Fourier transform, rather than the point- 
by-point consideration of each element alone, 
and thus advances design strategies in these 
areas beyond simple phase profiles. Moreover, 
this work suggests several interesting directions 
of research in multifunctional polarization op- 
tics. For instance, although polarization analyzers 
(polarizers) have been studied here, the matrix 
approach is quite general: Gratings implement- 
ing a wide variety of polarization operators on 
their orders [e.g., waveplates, optically active 
media, and possibly more exotic behavior (57)] 
could be envisioned. The constraint of linearly 
birefringent elements is by no means fundamen- 
tal, and with the freedom afforded by lithographic 
fabrication, elements with more complex polar- 
ization responses (52) or multilayer structures 
could be used to realize these behaviors. This 
work illustrates the ability of metasurfaces to 
markedly simplify the architecture of systems 
using polarization optics. Gratings enabled by 
the approach here present a simpler means of 
full-Stokes polarization imaging that can be eas- 
ily extended to other imaging systems and wave- 
lengths. These compact, lightweight, and passive 
devices could enable the widespread adoption of 
polarization imaging. 
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INTRODUCTION: Cattle, sheep, and goats 
appeared in eastern Africa 5000 years ago, 
catalyzing the spread of herding through- 
out sub-Saharan Africa. Archaeologists have 
long debated the geographic origins of east- 
ern Africa’s first herders, the extent to which 
people moved with livestock, and relationships 
among food-producing and foraging commu- 
nities. In this work, we integrate ancient DNA 
with archaeological, linguistic, and genetic evi- 
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African groups 
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Ancestry related 
to recent 
Sudanese groups 


Ancestry related to 
recent North African 
& Levantine groups 


Ancestry related to 
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Admixture 


: early herders: 


dence to explore how pastoralism developed 
within this region, establishing the roots of one 
of Africa’s dominant economic strategies. 


RATIONALE: Research into the spread of herd- 
ing has been limited by patchy archaeological 
data and poorly preserved human remains. 
Ancient DNA has the potential to untangle 
patterns of movement and interaction under- 
lying this economic and cultural transition. 


West African 
ancestry in 
early farmers 


Increased Sudanese : 
ancestry inherders : 


oi 


Genetic clustering 
despite cultural variation 


events in 


Relative genetic isolation of foragers 


East African foragers; : i : : 
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Admixture events contributing to ancestry of ancient eastern Africans. Results were 


inferred from genome-wide ancient DNA data from 41 individuals from archaeological sites in 
Kenya and Tanzania, analyzed together with published ancient and present-day genetic data. 
Black circles represent reported individuals, placed at their median calibrated radiocarbon 
dates (six individuals, five of whom have forager-related ancestry, had insufficient collagen for 
dating and thus are not represented here). Ancestry components depicted in green and gray 
continue to the present day (outside of eastern Africa) but are truncated for readability. 
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We generated genome-wide ancient DNA data 
from the remains of 41 individuals (35 direct- 
ly radiocarbon dated) associated with Later 
Stone Age (n = 3), early pastoral and Pastoral 
Neolithic ( = 31), Iron Age (n = 1), and Pas- 
toral Iron Age (n = 6) traditions in what are 
now Kenya and Tanzania to study how an- 
cient individuals were related to each other 
and to people living today. 


RESULTS: We document a multistep spread 
of herding and farming into eastern Africa. 
Ancient individuals genetically correlate with 
their archaeological associations: Later Stone 
Age individuals form part of a forager genetic 
cline, early pastoral and Pastoral Neolithic in- 
dividuals are most close- 
ly related to present-day 
Read the full article Afro-Asiatic speakers, and 
at http://dx.doi. Pastoral Iron Age indi- 
org/10.1126/ viduals show affinities to 
science.aaw6275 present-day Nilotic speak- 
slitentatedatietinntioatas bee Avelaldl Gaiea ot ae 
Iron Age agricultural site has shared ancestry 
with western Africans and Bantu speakers. 

We propose a four-stage model that fits the 
data. First, admixture in northeastern Africa 
created groups with approximately equal pro- 
portions of ancestry related to present-day 
Sudanese Nilotic speakers and groups from 
northern Africa and the Levant. Second, de- 
scendants of these northeastern Africans mixed 
with foragers in eastern Africa. Third, an addi- 
tional component of Sudan-related ancestry 
contributed to Iron Age pastoralist groups. 
Fourth, western African-related ancestry, sim- 
ilar to that found in present-day Bantu speak- 
ers, appeared with the spread of farming. 

We also observe a high frequency of a Y chro- 
mosome lineage associated with the spread of 
pastoralism, as well as a single individual with 
a genetic variant conferring adult lactase per- 
sistence. We do not detect any differentiation 
among individuals associated with two distinc- 
tive Pastoral Neolithic artifact traditions, sug- 
gesting that these represent cultural rather 
than ancestral differences. 


CONCLUSION: Archaeological and now ge- 
netic evidence suggest complex spreads of 
herding and farming in eastern Africa in- 
volving multiple movements of ancestrally 
distinct peoples as well as gene flow among 
these groups. Models formulated on the basis 
of ancient DNA are a starting point for further 
exploration through additional archaeological, 
linguistic, and genetic research. 
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How food production first entered eastern Africa ~5000 years ago and the extent to which 
people moved with livestock is unclear. We present genome-wide data from 41 individuals 
associated with Later Stone Age, Pastoral Neolithic (PN), and Iron Age contexts in what are 
now Kenya and Tanzania to examine the genetic impacts of the spreads of herding and 
farming. Our results support a multiphase model in which admixture between northeastern 
African—-related peoples and eastern African foragers formed multiple pastoralist groups, 
including a genetically homogeneous PN cluster. Additional admixture with northeastern 
and western African-related groups occurred by the Iron Age. These findings support several 
movements of food producers while rejecting models of minimal admixture with foragers 
and of genetic differentiation between makers of distinct PN artifacts. 


omestic sheep, goats, and cattle of south- 

west Asian origin were first introduced 

to northeastern Africa ~8000 calibrated 

years before the present (B.P.) and spread 

into eastern Africa beginning ~5000 B.P., 
ultimately reaching southernmost Africa by 
~2000 B.P. (1, 2). How pastoralism—a way of 
life centered on herding animals—spread into 
eastern Africa is unclear. Livestock appear in 
northern Ethiopia and Djibouti relatively late 
[~4500 to 4000 B.P. (3)] and are poorly docu- 
mented elsewhere in the Horn of Africa and in 
South Sudan. Instead, the earliest known domes- 
ticated animals in sub-Saharan Africa are found 
in Kenya at the beginning of the Pastoral Neo- 
lithic (PN) (~5000 to 1200 B.P.) era near Lake 
Turkana, where archaeological evidence docu- 
ments groups that pursued fishing and herding 
and constructed elaborate monumental ceme- 
teries (4-6). Although livestock spread quickly 


through the Turkana Basin, herding practices 
were not transmitted farther south for many 
hundreds of years. Sheep, goats, and pottery 
typical of Turkana began to trickle into Kenya’s 
south-central Rift Valley ~4200 B.P. (7, 8), but it 
was not until ~3300 B.P. that specialized pastoral- 
ism spread across Kenya and northern Tanzania, 
transforming the economic, social, and physical 
landscapes of the region (9-11). 

The core PN era (~3300 to 1200 B.P.) in Kenya 
and Tanzania witnessed the development of di- 
verse herder societies, some heavily reliant on 
livestock (2). However, pastoralism did not fully 
replace Later Stone Age (LSA) economies present 
in the region since ~50,000 B.P., creating a mosaic 
of herding and foraging communities on the 
landscape. Two contemporaneous pastoralist 
traditions have been identified: Elmenteitan 
and Savanna Pastoral Neolithic (SPN) (72, 13). 
Elmenteitan sites are found between the central 


Rift Valley and the western Lake Victoria Basin 
of Kenya. Occupants used a particular obsidian 
source, left behind distinctive lithic and ceramic 
traditions, and practiced primarily cremation 
burial. By contrast, SPN sites are found across 
a wider part of Kenya and Tanzania. Occupants 
used different obsidian sources, had greater 
diversity in material culture, and mainly buried 
their dead in cairns. The heterogeneous SPN 
category likely encompasses multiple groups. 
Some distinctions between SPN and Elmenteitan 
traditions, such as mortuary practices, are varia- 
ble (6), and relationships between PN groups— 
both cultural and genetic—remain uncertain. In 
addition, little is known about herder interac- 
tions with LSA foragers or about relationships 
among later PN herders and the first iron-using 
herders after ~1200 B.P. By this time, farming is 
also documented in the region (/4, 15). 

Archaeologists have debated the cultural and 
genetic affinities of the first pastoralists in east- 
ern Africa and the role that movement of people 
played in the spread of herding to the region. Be- 
cause the oldest instances of livestock remains 
and associated pottery and stone tool traditions 
have been found near Lake Turkana, it has been 
hypothesized that pastoralism was introduced by 
migrants from Sudan and/or Ethiopia, poten- 
tially in a series of small movements, and that 
their descendants gave rise to PN traditions far- 
ther south (72, 13, 15, 16). However, there are no 
unambiguous cultural connections between 
Kenya’s earliest herders and northern groups, 
and archaeological evidence supports the local 
adoption of herding to some degree (8, 16, 17). 
Other archaeological and linguistic evidence 
has been jointly used to hypothesize two ex- 
pansions into eastern Africa: an initial expansion 
of herders speaking Afro-Asiatic (specifically 
proto-Southern Cushitic) languages from the 
Horn of Africa linked with the SPN, and a sec- 
ond expansion of herders speaking Nilo-Saharan 
(specifically Nilotic) languages linked with the 
Elmenteitan. 

People of the second expansion have also been 
hypothesized to be ancestral to some Iron Age 
groups (18, 19). One subset of Rift Valley sites is 
designated Pastoral Iron Age (PIA) (~1200 B.P. to 
recent years) on the basis of material culture and 
evidence for herding, whereas other sites appear 
connected to farming and are classified into 
early, middle, and later Iron Age (IA) (~2500 B.P. 
to recent) variants (2, 14). Iron-working first 
entered eastern Africa via the Lake Victoria 
Basin ~2500 B.P. and spread toward the coast by 
2000 B.P. (74). This expansion may have brought 


{Division of Humanities, Saint Louis University, 28003 Madrid, Spain. Department of Genetics, Harvard Medical School, Boston, MA 02115, USA. “Department of Anthropology, Stony Brook 


1 


University of California, Santa Cruz, CA 95064, USA. 


Prendergast et al., Science 365, eaaw6275 (2019) 


5 July 2019 


University, Stony Brook, NY 11790, USA. “Department of Earth Sciences, National Museums of Kenya, Nairobi, Kenya. “Howard Hughes Medical Institute, Harvard Medical School, Boston, MA 
02115, USA. “Institutes for Energy and the Environment, Pennsylvania State University, University Park, PA 16802, USA. ‘Department of Anthropology, Pennsylvania State University, University 
Park, PA 16802, USA. “Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA. °Department of Anthropology, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA. 
Department of Public and Community Health, School of Health Sciences, Jaramogi Oginga Odinga University of Science and Technology, Bondo, Kenya. “Hutchins Center for African and 
African American Research, Harvard University, Cambridge, MA 02138, USA. *National Museums of Tanzania, Dar es Salaam, Tanzania. Livingstone Museum, Livingstone, Zambia. “Academy 
or Lifelong Learning, Western Washington University, Bellingham, WA 98225, USA. “Department of Anthropology, California State University, San Bernardino, CA 92407, USA. “Department of 
Anthropology, University of California, Santa Barbara, CA 93106, USA. 
*Corresponding author. Email: mary.prendergast@slu.edu (M.E.P.); mlipson@genetics.med.harvard.edu (M.L.); elizabeth.sawchuk@stonybrook.edu (E.A.S.); reich@genetics.med.harvard.edu (D.R.) 

These authors contributed equally to this work. {Present address: Department of Anthropology, University of California, Santa Cruz, CA 95064, USA. §Present address: Department of Biomolecular Engineering, 


1of10 


RESEARCH | RESEARCH ARTICLE 


UGANDA 


Lake bs 
Victoria @ 6 


TANZANIA 


Lake 20 
Eyasi x 


ETHIOPIA 


KENYA 


400km 


VOVNOS 


5000 


4000 
Years before present (BP) 


oe’ : 
L. Ol Bolossat 
‘e; a 
L. Nakuru , io 
L. Elmenteita “aps 
@14 Bas 
A'16 
Mt. Roary @17 
L. Naivasha 


0 10 20 40 50km_ 


3000 2000 1000 0 


0 100 200 300 
A 
LEGEND 
@ Later Stone Age site 
% Early pastoral site x 


@ Pastoral Neolithic site 


@ Pastoral Neolithic/Elmenteitan site 
Iron Age site 
A Pastoral Iron Age site 


A Possible Pastoral Iron Age site 
@ Pastoral Neolithic & Pastoral Iron Age sites 
© Population center 
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early IA farmers, thought to have spoken Bantu 
languages originating in equatorial western 
Africa, into contact with PN herders, although 
iron-working is not widely attested among 
herders until ~1200 B.P. at PIA sites (2, 15). Alter- 
natively, PIA sites may reflect other iron-working 
traditions entering from the north, potentially 
associated with additional movements of Nilotic- 
speaking pastoralists into and within the Rift (2). 
This complex mosaic of foragers, herders, and 
farmers gave rise to much of the present day ethno- 
linguistic landscape of eastern Africa (20). 
Rigorous testing of models for the spread of 
herding has been inhibited by several factors. A 
spatial and chronological gap exists between the 
earliest evidence of pastoralism in the Turkana 
Basin and the later PN expansion, with few mate- 
rial culture similarities. Additionally, relationships 
among PN and diverse Iron Age groups remain 
poorly understood. Human skeletal material from 
relevant contexts tends to be fragmentary, limit- 
ing bioarchaeological analysis, and reliable radio- 
carbon dates are rare. Finally, the persistence of 
foraging groups raises questions about interac- 
tion networks during this period (72, 15), as well 
as whether food production spread primarily via 
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demic expansion or via local adoption of novel 
practices and livestock. 

To address these debates, we generated genome- 
wide ancient DNA data from individuals buried 
at sites associated with LSA (n = 3), early pas- 
toral and PN (n = 31), IA (n = 1), and PIA (n = 6) 
archaeological traditions in what are now Kenya 
and Tanzania (Fig. 1, Table 1, and table S1). We 
extracted DNA from a combination of tooth and 
bone samples and enriched for a targeted set of 
~1.2 million single nucleotide polymorphisms 
(SNPs) (27). Surprisingly, given the tropical cli- 
mate and variable curatorial conditions, we ob- 
tained excellent data quality, with a median of 
approximately 0.51x coverage, or 440,000 SNPs 
covered by at least one sequence, for the 41 newly 
reported individuals (from a total of 67 sequenc- 
ing libraries; table S2). The data scored well in 
standard ancient DNA authenticity metrics for 
all but two individuals [112391 and 113970, whom 
we excluded from genome-wide analyses but for 
whom we obtained Y chromosome and mito- 
chondrial DNA (mtDNA) haplogroups (21)]. We 
also generated direct radiocarbon dates for 
35 individuals (tables S3 and S4 and fig. S1). We 
analyzed these data jointly with sequences from 


5 July 2019 


published ancient African individuals (22-27) as 
well as from people living in eastern Africa today 
(28-3D. 


Overview of genetic affinities of ancient 
eastern Africans 


We used principal components analysis (PCA) 
of the genome-wide data to visualize the genetic 
structure of the ancient individuals (Fig. 2 and 
table S5). We defined PCs using a small set of 
present-day groups [southern Africans, north- 
eastern Africans, and non-Africans (27)] and pro- 
jected a large number of diverse individuals onto 
these axes. An alternative analysis with western 
Africans additionally used to compute the axes 
yielded almost identical results (fig. S2). Present- 
day groups from Sudan mostly lie along a cline 
extending from Copts (upper right, near indi- 
viduals from northern Africa and the Levant) to 
Nilotic speakers such as Dinka and Nuer (lower 
left). Afro-Asiatic speakers (mostly from Ethi- 
opia) form a second cline, with the right end near 
Sudanese Beja and Nubians and the left end 
extending toward eastern African foragers [who 
themselves form a south-to-north gradient (22)]. 
Present-day Kenyans largely fall in the space 
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Fig. 2. Principal components analysis. Shaded regions are drawn to highlight notable clines of ancestry. Ancient individuals from this study are 
indicated in the key with asterisks. Dates for published ancient individuals outside of Kenya and Tanzania are ~4500 B.P. for Mota, ~8100 to 2500 B.P. for 
Malawi foragers, ~6500 to 5800 B.P. for Israel Chalcolithic, and ~3600 to 2000 B.P. for Egypt. ELM, Elmenteitan; LSA, Later Stone Age; PN, Pastoral 
Neolithic; IA, lron Age (18802); PIA, Pastoral Iron Age; ENP, early northeastern pastoralist. See (21) for more details and table S5 for the full list of 


individuals shown. 


between Sudanese, Ethiopians, and western 
Africans, with their language family affiliations 
tending to predict their broad-scale genetic 
affinities. 

The positions of ancient eastern Africans on 
the PCA strongly correlate with archaeological 
associations. The three individuals from LSA 
cultural contexts all cluster with previously re- 
ported ancient foragers, falling intermediately 
between those from southern Ethiopia (Mota) 
and coastal Tanzania (Zanzibar and Pemba is- 
lands), consistent with their geographic position 
(22, 24). Individuals from pastoralist contexts 
[including one from Luxmanda in Tanzania (22)] 
are highly differentiated from foragers, with 
the exception of three individuals uncertainly 
assigned a pastoral context at Gishimangeda 
Cave in Tanzania, who cluster with foragers. PN 
individuals, including Elmenteitan and those 
within the heterogenous SPN category (whom 
we refer to as “other PN”), mostly form a tight 
cluster near present-day Afro-Asiatic speakers, 
with a small number of modest outliers, includ- 
ing the two individuals buried at Prettejohn’s 
Gully, whose earlier date (~4000 B.P.) coincides 
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with the initial limited spread of herding into 
the area. Finally, five Iron Age individuals are 
shifted to the left in the PCA: four PIA indi- 
viduals toward Nilotic speakers and an IA child 
from Deloraine Farm (I8802)—the earliest agri- 
cultural site in Kenya’s Rift Valley (32)—toward 
western Africans and Bantu speakers. 

We also examined the uniparentally inherited 
loci (mtDNA and Y chromosomes) of the sam- 
pled individuals. The most pronounced pattern 
is the high frequency among the PN individuals 
(7 to 12 out of 17 males; table S6) of the E-M293 
haplogroup (Elb1b1b2b2a1), a Y chromosome 
lineage that has been hypothesized to be asso- 
ciated with the spread of pastoralism in the 
Horn of Africa, Kenya, and Tanzania and from 
there to southern Africa, on the basis of its 
present-day distribution and diversity (33, 34). 
Other males also carried haplogroups most fre- 
quently found in present-day eastern Africa, with 
the exception of E-M58 (Elblalalala; predom- 
inantly western African) in the IA individual 
18802, consistent with his position in the PCA. 
The observed mtDNA lineages form more of a 
mosaic pattern, including types most closely asso- 
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ciated with eastern and northeastern Africans, 
eastern African foragers, and northern Africans 
and western Eurasians (table $7). 


Formal modeling of admixture 


To obtain quantitative inferences about the ge- 
netic relationships among the ancient and present- 
day individuals, we used the gpAdm software 
(35, 36), which provides a flexible framework 
for testing admixture models and estimating 
mixture proportions. Guided by the PCA, we 
began by using three groups of individuals— 
present-day Dinka (28), ancient Chalcolithic- 
period individuals from Israel (25), and the 
~4500 B.P. forager from Mota, southern Ethiopia 
(24)—to represent distinct components of ances- 
try plausibly found in ancient and present-day 
eastern Africans, with present-day western Africans 
among the outgroups (27). Note that the use of 
these proxy groups in gpAdm modeling does 
not imply an assumption that they are directly 
ancestral to the true sources contributing to the 
individuals we analyzed. Instead, for a model to 
be properly formulated, the reference groups only 
need to be more closely related to the true sources 
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than are the outgroups, without substantially dif- 
ferent admixture (35). Thus, for example, ancestry 
related to the Chalcolithic Israel reference individ- 
uals could plausibly have originated anywhere in 
northeastern Africa or the Levant and could have 
been present in northeastern Africa for many 
thousands of years. We use the Chalcolithic in- 
dividuals in this study because we lack genetic 
data from a phylogenetically adjacent reference 
group from Egypt, Sudan and/or South Sudan, 
or the Horn. Additionally, gqaAdm does not re- 
quire any assumptions regarding the internal 
phylogeny relating the references and outgroups, 
and it provides both standard errors for mixture 
proportion estimates and a P value for overall 
model fit quality (35). 

Our gpAdm modeling reveals that the PN 
individuals had substantial proportions of all 
three ancestry components (~40% each for those 
represented by Dinka and by the Chalcolithic 
Israel individuals and ~20% related to Mota) 
(Fig. 3 and tables S8 and S9), with no evidence 
of western African-related ancestry. The indi- 
viduals from Prettejohn’s Gully can also be well 
modeled using the same three components, but 
in a modestly different ratio. The Iron Age group 
as a whole (including the more recent ~300 B.P. 
individual from Emurua Ole Polos but excluding 
the possible PIA individual from Kokurmatakore) 
does not fit well under a three-way model, but 
the fit improves markedly when we exclude the 
Deloraine Farm individual I8802 (P = 0.009 
versus 0.0003). The remaining four individuals 
(who are confidently assigned to PIA contexts) 
are inferred to have substantially more Sudan 
(Dinka)-related ancestry (~60%) than is seen 
in the PN. We also observe similar patterns for 
present-day groups falling near the ancient in- 
dividuals in PCA [using data from (3)], whereby 
the three-way model fits better for Afro-Asiatic- 
and Nilo-Saharan-speaking groups than for Bantu- 
speaking groups (table S8). Consistent with the 
PCA results, Afro-Asiatic speakers are inferred 
(as in PN) to have relatively even proportions of 
the components represented by Dinka and by 
Chalcolithic Israel (but with varying proportions 
of Mota-related ancestry), whereas Nilo-Saharan 
speakers are inferred to have more Sudan-related 
ancestry. Alternative model formulations in which 
we use either ancient individuals from Taforalt in 
Morocco (27) in place of the Chalcolithic Israel 
group or present-day Lemande from Cameroon 
(28) in place of Dinka (with Dinka moved to the 
outgroup set) fit significantly worse for most test 
groups (table S8). 

From these results, we formulated a four-part 
hypothesis to explain the origins of the ances- 
tries in the sampled eastern African groups. First, 
admixture in northeastern Africa, likely asso- 
ciated with the spread of pastoralism, created 
groups (as yet unsampled with ancient DNA) 
with approximately equal proportions of ances- 
try related to (i) present-day Nilotic speakers 
such as Dinka and Nuer and (ii) sampled an- 
cient and present-day groups from northern 
Africa and the Levant. We refer to this combi- 
nation as early northeastern African pastoralist- 
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associated ancestry (henceforth “early north- 
eastern pastoralist,” or ENP) and the two sub- 
components as EN1 and EN2. Second, descendants 
of these groups mixed with local foragers in 
eastern Africa, leading to the ~20% Mota-related 
ancestry in the PN individuals. Third, an addi- 
tional period of Sudan-related gene flow oc- 
curred before the Iron Age and contributed to 
PIA groups. Fourth, close to the same time, 
western African-related ancestry related to 
present-day Bantu speakers [also seen in an 


Northeastern African-related 
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individual buried on Pemba Island ~600 B.P. 
(22)] appeared in the Rift Valley (notably at 
Deloraine Farm), in association with the spread 
of farming. 

To test these hypotheses and gain further 
insight into changes in ancestry over time, we 
carried out a second round of analysis in goaAdm 
using pairs of reference groups linked more 
closely with each historical phase. For the ini- 
tial spread of pastoralism, we used Hadendowa 
[Sudanese Beja (29)] plus Mota. It is likely that 


E. African 
forager-related 


W. African- 
related 


Kikuyu Luhya lraqw 
(Kenya, (Kenya, (Tanzania, 
NS) BA) BA) AA) 


Fig. 3. Proposed model of admixture for ancestry in eastern Africans. Solid-color bars 
represent lineages of northeastern African (EN1/Sudan-related in green, EN2 in gray), 

eastern African forager-related (orange), and western African-related (blue) ancestry, and 
mixed-color bars represent admixed groups (hypothesized early northeastern pastoralists, green 
plus gray). Pie charts show ancestry proportions for sampled ancient (embedded in figure 

at approximate date points) and present-day (bottom) groups inferred from qpAdm (PN-related 
ancestry as mixed-color sections). Black arrows represent likely ongoing interactions and not 
specific admixture events inferred from the data. EN1 and EN2, early northeastern pastoralist 
source groups; PN, Pastoral Neolithic; ELM, Elmenteitan; PIA, Pastoral Iron Age; IA (DF), Iron Age 
(Deloraine Farm); AA, Afro-Asiatic; NS, Nilo-Saharan; BA, Bantu. 
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the genetic landscape of northeastern Africa 
has changed substantially since the time of the 
events we are modeling, so we do not propose 
that Hadendowa are descended directly from 
ENP; rather, we chose them to serve as a proxy 
for the (approximate) mixture of ancestries hy- 
pothesized to be present in the true ENP-related 
source (on the basis of the PCA and qpAdm 
results above). This two-way model yields a 
good fit for the PN individuals (P = 0.45) but not 
for Iron Age individuals (either the PIA cluster 
or the IA individual from Deloraine Farm) or 
present-day Nilotic- and Bantu-speaking groups 
(all P < 1 x 10°; table S8). We also attempted 
to fit PN as a mixture of possible early Kenyan 
pastoralist (represented by Prettejohn’s Gully) 
and forager-related ancestry, but this combi- 
nation was rejected (P < 1 x 10°° using either 
Mota or the three Kenyan LSA individuals to 
represent the forager component), suggesting 
that the two ancient pastoralist groups are not 
simply differentiated by their proportions of 
forager-related ancestry. 

Finally, to study later transformations, we 
built models using PN as one proxy source and 
Dinka (Sudan-related), Mota (forager-related), 
or Lemande (western African-related) as the 
other. We obtain improved fits for the Iron 
Age individuals and for present-day Kenyan 
Nilotic- and Bantu-speaking groups in this 
framework: The PIA cluster can be fitted as a 
mixture of ~57% PN-related and ~43% Sudan- 
related ancestry, whereas the Deloraine Farm 
individual can be modeled as a mixture of ~29% 
PN-related and ~71% western African-related 
ancestry (Fig. 3). Similar models also yield good 


fits for present-day Maasai (~47% PN-related 
and ~53% Sudan-related) and Kikuyu (~40% 
PN-related and ~60% western African-related), 
whereas Luhya can be fit as a mixture of Sudan- 
related (~41%) and western African-related (~59%) 
ancestry (Fig. 3). 

We also used direct tests of asymmetry in 
allele frequencies to investigate the fine-scale 
genetic structure of the PN cluster and related 
individuals (table S10). First, in agreement with 
their colocalization in PCA, we do not detect any 
significant differences in allele-sharing between 
Elmenteitan and other PN individuals relative 
to a set of 27 comparison groups, including the 
Iron Age and possible early pastoralist groups 
from this study (maximum nominal Z score = 
2.1; see also Fig. 4). There are hints of dif- 
ferentiation (maximum Z = 2.6) between the 
main PN cluster and individual 18904 from 
Kokurmatakore [previously dated to the PIA 
(37)], but this individual’s ancestry is much 
more similar to other PN individuals than to 
PIA (Fig. 2 and table S10). We also find only 
minor asymmetry between the primary Kenyan 
and Tanzanian PN clusters (maximum Z = 2.5). 
However, four PN-period individuals who appear 
as outliers on PCA do have statistically signifi- 
cant ancestry differences as compared with the 
PN cluster. In particular, two individuals from 
Gishimangeda Cave (113972 and 113978) and the 
previously reported ~3100 B.P. pastoralist indi- 
vidual from Luxmanda in Tanzania (22) have 
evidence of more or different forager-related 
ancestry relative to Sudan-related ancestry [e.g., 
f,{Ancient South African foragers, Dinka; X, 
PN) > 0, Z = 3.2, 5.1, and 6.8, respectively; Fig. 4 


100% 
90% i 
80% 
3 LE ff ge eg ee a ee a 
70% 
60% 
50% 
40% 
30% 
20% 
10% 
0% . } 

YD HWS QD BD PB 9 DO SD DA PDS) QP PS A) 
Ss. Re o. a & i & i o & & & & a ie & = < & o oe ow & & ie e < 
PG AV ad A TGV nD DAD TAD TAD TAD AD My HE Hy TD DD =D ~ a “A 
Pod YE OSS SAY RS OP GH SOP Ne SP NO AP DEH LP AP PP _F \' 
SV gS 0D oP PD? PAW DE PP PPP PPP VP? PP DD PP PVP SF PP 
SS Se Oe ee ae ae ee 

Vv 


» ENP-related 


u Eastern African forager-related 


Fig. 4. Mixture proportions for PN individuals. Results are from a two-component qpAdm model 
using Hadendowa (green-and-gray striped bars) and the ancient individual from Mota, Ethiopia 
(orange bars) as proxy sources [for early northeastern pastoralist (ENP) and eastern African 
forager—related ancestry, respectively]. Radiocarbon-dated individuals (to the right of the solid line) 
are ordered from most ancient on the right (18874, 3350 to 3180 B.P.) to most recent on the left 
(112394, 1530 to 1400 B.P.). Bars show two standard errors in each direction. The dashed line 
represents the Kenya PN group-level estimate (74.7 + 10% ENP-related ancestry). Note that the 
linear regression coefficient for forager-related ancestry is not significantly nonzero as a function of 
date (R* = 0.03, P = 0.39) or latitude (R* = 0.03, P = 0.37). ELM, Elmenteitan; PN, other Kenyan 
Pastoral Neolithic; TA, Tanzanian PN [including the Luxmanda individual from (22)]; unc., uncertain. 
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and tables S8 and S10], whereas individual I8759 
(an early PN individual buried at Naishi Rock- 
shelter in Kenya) has evidence of less forager- 
related ancestry [e.g., f,(Ancient South African 
foragers, Europeans; I8759, PN) < 0, Z = —4.4]. 
We also confirm the differences in ancestry be- 
tween the PN cluster and the two possible early 
pastoralist individuals from Prettejohn’s Gully 
(both Z > 5). Although the individuals from 
Prettejohn’s Gully fall relatively far apart on 
PCA (Fig. 2), their ancestry is only weakly dif- 
ferentiated via f statistics (maximum Z = 2.2; 
table S10). 


Dates of admixture 


The fact that we observe tight clustering of PN 
individuals via PCA and gpAdm, with little if any 
spatial or temporal structure as revealed by di- 
rect dating (Fig. 4 and table S9), suggests that the 
admixture responsible for forager-related ances- 
try in the PN had largely ceased before the life- 
times of our sampled individuals. To test this 
hypothesis, we used the MALDER software 
(38, 39) to estimate dates of admixture for pairs 
of high-coverage individuals with similar direct 
radiocarbon dates and locations (27). All pairs 
have inferred dates that point to distant average 
times of admixture (mean ~4600 B.P. for PN 
and ~5300 B.P. for Prettejohn’s Gully; Fig. 5 and 
table S10), with the concordance among the PN 
estimates providing an independent line of evi- 
dence for a lack of substantial ongoing mixture. 
We infer a more recent average date (~2200 B.P.) 
for two late PIA individuals, likely associated 
with additional Sudan-related ancestry (table S11). 
Our power to detect multiple waves of admix- 
ture is limited with ancient data, but for one 
pair of PN individuals from Naivasha Burial Site, 
we are confidently able to identify two separate 
events, the first at ~5100 B.P. and the second at 
~4000 B.P. We also infer two waves for a pair of 
individuals from Gishimangeda Cave, dating to 
~6000 B.P. and ~4000 B.P. In light of our gpAdm 
results, and given the associated MALDER am- 
plitudes (table S11), these multiple dates plausibly 
represent estimates of the times of (i) the for- 
mation of admixed ENP ancestry and (ii) admix- 
ture in eastern Africa between local foragers and 
descendants of the first mixture, leading to the 
three-component ancestry of PN individuals. In 
this context, the single (and intermediate) esti- 
mated dates for other PN pairs can be interpreted 
as averages of these two processes (Fig. 5). 


Incorporating genetic evidence 
into models for the spread 
of food production 


The four-phase model emerging from our ge- 
netic and radiocarbon dating results builds on 
archaeological reconstructions for the spread 
of herding into eastern Africa, supporting some 
theories while rejecting others that until now 
have been considered plausible. Under a proposed 
“moving frontier” model, herders entering new 
environments would interact in diverse ways with 
Indigenous foragers, resulting in varying cultural 
responses and blurred archaeological boundaries 
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Fig. 5. Dates of admixture inferred for pairs of ancient individuals. Bars show two standard 
errors in each direction. The shaded areas represent implied periods of admixture (Adm.): ENP (early 
northeastern pastoralist; red), forager-related (blue), and additional Sudan-related (green). Early P, 
early pastoralists; PN, Pastoral Neolithic; PIA, Pastoral Iron Age; TA, Tanzanian PN. See also table S11. 


as groups adopted some of each other's cultural 
practices. In the ensuing “static frontier,” more 
intensive herding and/or competition would 
transform initial relationships into more stable, 
long-term patterns (40). Archaeologists interpret 
the construction of cemeteries by the first herders 
in the Turkana Basin and the apparent trickle 
of people with similar material culture into the 
south-central Rift Valley as part of a moving 
frontier, whereas the explosion of pastoralist cul- 
tures in the PN reflects more established, static 
herder-forager relationships (6, 15). 

On the basis of genetic data from a wide 
sampling of PN individuals, we infer two phases 
of admixture associated with the spread of pas- 
toralism: the first likely ~6000 to 5000 B.P. in 
northeastern Africa and the second ~4000 B.P. 
between this admixed ENP group and eastern 
African foragers. Archaeological data show that 
the Nile Valley was important for herders seek- 
ing reliable water sources toward the end of the 
African Humid Period (~’7000 to 6000 B.P.) (1, 41). 
Herders plausibly traced the White Nile south- 
ward, following unknown trajectories through 
South Sudan and/or southern Ethiopia to arrive 
in the Turkana Basin ~5000 B.P. (5, 6). Alterna- 
tively, they may have moved via the Horn of 
Africa, but current evidence for herding in that 
region postdates that of Turkana (3). Our results 
support archaeological hypotheses that no mat- 
ter the routes they took, early herders interacted 
with local foragers as they spread (16, 42). In 
eastern Africa, extensive forager-herder interac- 
tions have been proposed, both in the Turkana 
Basin and during the initial trickle of herding 
into the south-central Rift Valley (6-10, 16, 17). 
Either area, or another unsampled region, could 
have witnessed the admixture we document be- 
tween descendants of the (already admixed) ENP 
group and local foragers, giving rise to the groups 
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who then developed the PN cultural traditions of 
southern Kenya and northern Tanzania. 

Our attempts to extract DNA from early herd- 
ers in Turkana were unsuccessful (27), so the 
genetic ancestry (or ancestries) of the first eastern 
African pastoralist groups remains uncertain. 
However, some lineages may be reflected in a 
man and a woman buried at Prettejohn’s Gully 
~4000 B.P. There are few associated artifacts, 
but the individuals’ genetic profiles suggest that 
they may represent an initial limited dispersal 
of herders into the south-central Rift Valley that 
did not leave large numbers of descendants. Pre- 
viously, evidence of herding before ~3300 B.P. 
was limited to Turkana-related Nderit pottery 
found sporadically, and usually undated, in the 
area (8), and to sheep or goat remains associ- 
ated with a date of ~4200 B.P. at a site 33 km 
south of Prettejohn’s Gully (7). Genetically, the 
two individuals are most similar to those from 
PN sites, but they fall outside the range of sam- 
pled PN (and present-day) variation and cannot 
be modeled as directly related to PN. They also 
have a suggestively older date of admixture, and 
the male individual (112533) carries a Y chromo- 
some haplogroup (E2; E-M75) not found in any of 
our sampled PN individuals. Our results thus 
imply at least two chronologically distinct move- 
ments of herders through eastern Africa, consist- 
ent with archaeological evidence of complex 
spreads (2), while adding new information by 
showing that one group (the one that gave rise 
to the PN cluster) was eventually much more 
demographically successful than the others. 

Although Prettejohn’s Gully may represent a 
limited trickle of herders into the south-central 
Rift Valley, numerous PN sites after ~3300 B.P. 
attest to successful specialized pastoralism. Ar- 
chaeologists attribute this florescence to herder 
innovations that allowed them to overcome en- 
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vironmental and disease barriers, likely facilitated 
by strong social networks reflected in widespread 
material cultural traditions (8-17). The dense 
cluster of PN individuals in our PCA—including 
burials >450 km apart—suggests that these 
networks formed among people with shared 
ancestry, with the close relatedness perhaps 
reinforced by ongoing mobility and gene flow. 
Moreover, it is notable that individuals in our 
sample buried with distinctive Elmenteitan ma- 
terial culture display minimal genetic differen- 
tiation from those of other PN burials. Strong 
Elmenteitan material cultural traditions may 
reflect maintenance of social boundaries, but 
our results do not support the view that these 
people were genetically distinct (72, 78). In 
comparison to present-day groups, all PN in- 
dividuals (associated with both the SPN and 
Elmenteitan material cultures) show the greatest 
genetic affinity to Afro-Asiatic speakers, support- 
ing the hypothesis that the initial large-scale ex- 
pansion of pastoralism in eastern Africa was linked 
to the spread of Afro-Asiatic languages (18, 19). 

With regard to the moving frontier model, we 
find that although sampled PN individuals carry 
~20% admixture from local forager groups, al- 
most all of this gene flow occurred well before 
the core PN era, as herders entered new environ- 
ments. By contrast, the rapid spread of pastor- 
alists into Kenya and Tanzania after ~3300 B.P. 
involved minimal gene flow between herders and 
foragers, plausibly due to the formation of a static 
frontier along which social barriers prevented 
large-scale gene flow, despite possible social and 
economic interaction (8, 15). Alternatively, pop- 
ulation densities of foragers may have been so 
low that gene flow between the groups had little 
demographic impact on the more numerous pas- 
toralists (72). Static frontiers were not absolute, 
however, in agreement with ethnographic and 
ethnohistoric records that testify to some inter- 
marriage between foragers and food producers 
(43, 44). Today, for example, the Eyasi Basin is an 
important interaction zone for diverse foraging 
and food-producing groups (44) and is home to 
speakers of each of the four main African lan- 
guage phyla. In our data, the ancestries of the 
individual buried at Luxmanda (22), the south- 
ernmost known PN site (77), and two newly re- 
ported individuals from Gishimangeda Cave in 
the Eyasi Basin attest to additional admixture 
with foragers beyond the events contributing to 
the possible early pastoralists from Prettejohn’s 
Gully and to the main PN cluster. Furthermore, 
at Gishimangeda Cave, we observe three indi- 
viduals clustering genetically with foragers, which 
may reflect social ties between people with dif- 
ferent ancestry and/or ways of life. However, 
given that the three forager-related individuals 
produced insufficient collagen for radiocarbon 
dating and genetic data with substantially lower 
coverage than that of the pastoralist-related 
individuals, we speculate that the observation of 
distinct ancestry types is more likely to be a 
consequence of multiple burial phases at the 
site (Le., greater antiquity for the forager-related 
individuals). 
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Low frequency of genetic adaptation to 
milk consumption 

To test whether the success of PN groups in 
eastern Africa was aided by genetic adaptations 
linked to diet, we also evaluated the sequenced 
individuals for presence or absence of genetic 
variants associated with adult lactase persistence 
(LP) (table S12). Although our coverage is limited 
for some individuals and some SNPs, we observe 
only one instance of an LP-conferring mutation, 
in individual 113762, from Gishimangeda Cave in 
Tanzania. This individual, who falls within the 
main PN genetic cluster and lived during the 
later PN (2150 to 2020 B.P.), carried the derived 
allele at the rs145946881 (G/C-14010) SNP, which 
is the most common LP mutation found among 
eastern African groups today. The other ancient 
individuals could possibly have carried differ- 
ent variants conferring the same phenotype, but 
the assayed SNPs are found at high frequencies 
in some present-day eastern African groups and 
thus are likely to have been important historically 
(45). This finding suggests that eastern African 
pastoralists were mostly lactose intolerant as 
recently as 3000 to 1000 years ago and that the 
LP alleles only recently rose in frequency, al- 
though our results also demonstrate that the 
G/C-14010 mutation was present and could have 
been a target for natural selection by the PN 
period. Direct evidence for dairying is currently 
lacking in the region, despite the specialized 
pastoralist lifestyle inferred from faunal remains 
at PN sites (8). However, culinary innovations 
such as fermentation could have enabled dairy 
consumption even in the absence of LP. 


Increasing complexity in the Iron Age 


The eastern African Iron Age can be summarized 
archaeologically as a mosaic in which foragers, 
herders, and early farmers with distinct tradi- 
tions and ways of life overlapped in space and 
time (2, 14, 15, 19). This complexity is reflected in 
the ancient individuals we analyzed. The young 
boy buried at Deloraine Farm—the site with the 
earliest direct evidence of farming in the Rift 
Valley (32)—shows affinity to western Africans 
and speakers of Bantu languages (both genome- 
wide and on the Y chromosome). This is the ear- 
liest documentation of western African-related 
ancestry in eastern Africa, in a region where 
today such ancestry is widespread and the ma- 
jority of people speak Bantu languages (46). 
The Deloraine Farm child’s genetic distinctive- 
ness as compared with the PN cluster is notable 
in light of similarities in artifacts between El- 
menteitan sites and Deloraine Farm, viewed as 
evidence of continuity from the Elmenteitan to 
the Iron Age (32, 47). By contrast, four PIA in- 
dividuals spanning an ~800-year period show 
greater affinity to present-day Nilotic speakers 
and are associated with an influx of Sudan 
(Dinka)-related ancestry. Similarities between 
archaeologically and ethnographically docu- 
mented material culture suggest that PIA sites 
may be associated with ancestors of present-day 
Kenyan Nilotic speakers such as the Kalenjin or 
Maasai (32, 47). Both the PIA individuals and 
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present-day Maasai retain substantial compo- 
nents of PN-related ancestry, showing that the 
ancestry composition of PIA and more recent 
pastoralists reflects mixture with previously es- 
tablished herder groups in eastern Africa. For 
other groups, such as Luhya (who speak a Bantu 
language), there is little evidence of PN-related 
ancestry, suggesting that their ancestors spread 
into Kenya without mixing substantially with 
local herders. Boundaries between foragers and 
food producers may have been maintained during 
the Iron Age, as we do not observe a significant 
increase in forager ancestry in the PIA or IA in- 
dividuals, but we cannot rule out a small pro- 
portion of additional forager-associated admixture. 
Overall, we caution that these interpretations 
are limited by small sample sizes and may not 
reflect the more nuanced regional dynamics 
within this mosaic. 


Genetic diversity of eastern 
African foragers 


Archaeological evidence of foragers across Holo- 
cene eastern Africa encompasses an array of 
material culture and subsistence traditions (48). 
This study adds to our understanding of LSA 
genetic variation by reporting ancient DNA from 
additional foragers without pastoralist-related 
admixture, including from fisher-foragers near 
Lake Victoria who may have been living con- 
temporaneously with PN herders in the broader 
region. These individuals fall in an intermediate 
position between Ethiopian and Tanzanian for- 
agers on a genetic cline that is well correlated 
(among sampled ancient individuals) with geo- 
graphical location (22). Broadly, however, the 
similarity of foragers buried in the Victoria and 
Eyasi basins to individuals living on the Kenya 
coast and in Ethiopia and coastal Tanzania (22, 24) 
suggests that shared forager ancestry extended 
widely across the region, as also attested by 
present-day genetic data (20). 


Outlook 


Genome-wide data from 41 ancient eastern 
Africans show that archaeological complexity 
during the spreads of herding and farming is 
also reflected in genetic patterns, which indicate 
multiple movements of and gene flow among 
ancestrally distinct groups of people. We iden- 
tify three components of ancestry harbored by 
ancient pastoralists and propose a sequence of 
admixture events to explain our observations; 
future archaeological and ancient DNA research 
in the Turkana Basin, the Horn of Africa, and 
other parts of northeastern Africa will be nec- 
essary to confirm the earliest stages of the 
spread of herding into the region. At the other 
end of our timeframe, we document multiple 
admixture events affecting Iron Age groups asso- 
ciated with heterogeneous economic, cultural, 
and linguistic patterns. This complexity can be 
further explored through additional comparisons 
of genetic and archaeological diversity. Ancient 
DNA offers a new source of information about 
eastern African Holocene prehistory, and an im- 
portant next direction is to integrate this infor- 
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mation rigorously with insights provided by the 
longer-established disciplines of archaeology and 
linguistics. 


Materials and methods summary 


Human skeletal remains from eastern African ar- 
chaeological sites, including the petrous portion 
of the skull, teeth, and other bones, were sam- 
pled from the National Museums of Kenya and 
Tanzania and the Livingstone Museum in Zambia, 
following protocols to minimize both destruc- 
tion and contamination. Bioarchaeological data 
on the analyzed individuals, along with detailed 
information about archaeological context, are 
provided in the full materials and methods (27). 
DNA was extracted from bone powder in dedi- 
cated clean rooms at Harvard Medical School 
using protocols optimized for ancient DNA. 
Illumina sequencing libraries were prepared 
with uracil-DNA-glycosylase (UDG) to reduce 
deamination-induced errors. Before sequencing, 
libraries were enriched for molecules overlap- 
ping ~1.2 million genome-wide SNPs. Of the 77 
samples processed for this study, 43 (from 41 
distinct individuals) provided genome-wide an- 
cient DNA data. Direct radiocarbon dates were 
generated at the Pennsylvania State University 
(PSU) Radiocarbon Laboratory via accelerator 
mass spectrometry (AMS). 

Raw sequencing data were filtered and aligned 
to the human reference genome. One sequence 
per individual was chosen randomly from those 
overlapping each targeted SNP to represent that 
individual at that position. On the basis of au- 
thenticity metrics, two individuals were excluded 
from genome-wide analyses. The other 39 indi- 
viduals were analyzed in conjunction with pub- 
lished genetic data from ancient individuals and 
present-day groups, using a variety of statistical 
approaches. Multiple population genetics meth- 
ods were applied to investigate proportions, 
sources, and dates of admixture, with a partic- 
ular emphasis on testing of proposed admixture 
models through the gpAdm software. 
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PROTEIN STABILITY 


A glycine-specific N-degron pathway 
mediates the quality control of 
protein \V-myristoylation 


Richard T. Timms, Zhiqian Zhang, David Y. Rhee, J. Wade Harper, 


Itay Koren*, Stephen J. Elledge* 


INTRODUCTION: The ubiquitin-proteasome 
system is the major route through which the 
cell achieves selective protein degradation. The 
E3 ubiquitin ligases are the major determinants 
of specificity in this system, which is thought to 
be achieved through their selective recognition 
of specific degron motifs in substrate proteins. 
However, our ability to identify these degrons 
and match them to their cognate E3 ligase 
remains a major challenge. 


RATIONALE: It has long been known that 
the stability of proteins is influenced by their 


N-terminal residue, and a large body of work 
over the past three decades has characterized 
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a collection of N-end rule pathways that target 
proteins for degradation through N-terminal 
degron motifs. Recently, we developed Global 
Protein Stability (GPS)-peptidome technology 
and used it to delineate a suite of degrons that 
lie at the extreme C terminus of proteins. We 
adapted this approach to examine the stabil- 
ity of the human N terminome, allowing us to 
reevaluate our understanding of N-degron path- 
ways in an unbiased manner. 


RESULTS: Stability profiling of the human 
N terminome identified two major findings: 
an expanded repertoire for UBR family E3 
ligases to include substrates that begin with 
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The glycine N-degron pathway. Stability profiling of the human N-terminome revealed that 
N-terminal glycine acts as a potent degron. CRISPR screening revealed two Cul2 complexes, 
defined by the related substrate adaptors ZYG11B and ZERI1, that recognize N-terminal glycine 
degrons. This pathway may be particularly important for the degradation of caspase cleavage 
products during apoptosis and the removal of proteins that fail to undergo N-myristoylation. 
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arginine and lysine following an intact initia- 
tor methionine and, more notably, that glycine 
positioned at the extreme N terminus can act 
as a potent degron. We established human em- 
bryonic kidney 293T reporter cell lines in which 
unstable peptides that bear N-terminal glycine 
degrons were fused to green fluorescent pro- 
tein, and we performed CRISPR screens to 
identify the degradative machinery involved. 
These screens identified two Cul2 Cullin-RING 

E3 ligase complexes, de- 
fined by the related sub- 
Read the full article Strate adaptors ZYG1IB 
at http://dx.doi. and ZERI, that act redun- 
org/10.1126/ dantly to target substrates 
science.aaw4912 bearing N-terminal glycine 
scat ute tees meant catalan destons for proteasonial 
degradation. Moreover, through the saturation 
mutagenesis of example substrates, we defined 
the composition of preferred N-terminal gly- 
cine degrons specifically recognized by ZYG11B 
and ZERI1. 

We found that preferred glycine degrons are 
depleted from the native N termini of meta- 
zoan proteomes, suggesting that proteins have 
evolved to avoid degradation through this 
pathway, but are strongly enriched at anno- 
tated caspase cleavage sites. Stability profil- 
ing of N-terminal peptides lying downstream 
of all known caspase cleavages sites con- 
firmed that Cul27%°"8 and Cul2““™! could 
make a substantial contribution to the removal 
of proteolytic cleavage products during ap- 
optosis. Last, we identified a role for ZYG11B and 
ZER1 in the quality control of N-myristoylated 
proteins. N-myristoylation is an important post- 
translational modification that occurs exclu- 
sively on N-terminal glycine. By profiling the 
stability of the human N-terminome in the 
absence of the N-myristoyltransferases NMT1 
and NMT2, we found that a failure to un- 
dergo N-myristoylation exposes N-terminal 
glycine degrons that are otherwise obscured. 
Thus, conditional exposure of glycine degrons 
to ZYG11B and ZER1 permits the selective 
proteasomal degradation of aberrant proteins 
that have escaped N-terminal myristoylation. 


CONCLUSION: These data demonstrate that 
an additional N-degron pathway centered on 
N-terminal glycine regulates the stability of 
metazoan proteomes. cula7"®_ and Cul27"*- 
mediated protein degradation through N- 
terminal glycine degrons may be particularly 
important in the clearance of proteolytic frag- 
ments generated by caspase cleavage during 
apoptosis and in the quality control of protein 
N-myristoylation. 
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A glycine-specific N-degron pathway 
mediates the quality control of 
protein \-myristoylation 


Richard T. Timms”, Zhiqian Zhang””, David Y. Rhee®, J. Wade Harper*, 


Itay Koren”?*+, Stephen J. Elledge”?* 


The N-terminal residue influences protein stability through N-degron pathways. We used 
stability profiling of the human N-terminome to uncover multiple additional features of 
N-degron pathways. In addition to uncovering extended specificities of UBR E3 ligases, we 
characterized two related Cullin-RING E3 ligase complexes, Cul27’¢"8 and Cul27£®!, that 
act redundantly to target N-terminal glycine. N-terminal glycine degrons are depleted 

at native N-termini but strongly enriched at caspase cleavage sites, suggesting roles for 
the substrate adaptors ZYG11B and ZER1 in protein degradation during apoptosis. 
Furthermore, ZYG11B and ZER1 were found to participate in the quality control 

of N-myristoylated proteins, in which N-terminal glycine degrons are conditionally exposed 
after a failure of N-myristoylation. Thus, an additional N-degron pathway specific for 
glycine regulates the stability of metazoan proteomes. 


he ubiquitin-proteasome system (UPS) is 

the major route through which eukaryotic 

cells achieve selective protein degradation 

(1). The specificity of this system is pro- 

vided by E3 ubiquitin ligases, of which 
more than 600 are encoded in the human ge- 
nome. E3 ligases recognize specific sequence 
elements, known as degrons, that are present 
in substrate proteins (2). However, although 
a detailed knowledge of the specificity of E3 
ligases for degrons will be essential for achiev- 
ing a systems-level understanding of the UPS, 
our current knowledge of degron motifs remains 
remarkably sparse (3). 

The first degrons to be discovered were located 
at the N terminus of proteins (4). N-terminal 
degrons are targeted by N-degron pathways 
[formerly known as N-end rule pathways (5)], 
of which there are two main branches: the 
Arg/N-degron pathway, through which UBR- 
family E3 ligases target N-termini typically 
generated through endoproteolytic cleavage 
(6, 7), and the Ac/N-degron pathway, through 
which proteins bearing acetylated N-termini 
are targeted for degradation by the E3 ligase 
MARCH6 (also known as TEB4) (8, 9). In ad- 
dition, a Pro/N-degron pathway was recently 
described, through which proteins harboring 
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an N-terminal proline residue are degraded by 
the GID E3 ligase complex (fig. S1) (0). Theo- 
retically, these pathways have the capacity to 
target the majority of cellular proteins, but the 
extent to which they affect protein stability in a 
physiological context remains unclear. For ex- 
ample, loss of N-terminal acetyltransferase (NAT) 
enzymes has minimal effect on protein stability 
in yeast (17), which is inconsistent with a wide- 
spread role for the Ac/N-degron pathway. 

Previously, we modified the Global Protein 
Stability (GPS) system (72) to develop a high- 
throughput method to characterize degron motifs 
in human proteins (73). This approach is based on 
a lentiviral expression vector that encodes two 
fluorescent proteins: DsRed, which serves as an 
internal reference, and green fluorescent protein 
(GFP) fused to a short peptide of interest, which is 
translated from an internal ribosome entry site 
(IRES). Because both DsRed and the GFP-peptide 
fusion protein are expressed from the same 
transcript, the GFP/DsRed ratio can be used to 
quantify the effect of the peptide sequence on the 
stability of GFP (13). We exploited the ubiquitin- 
fusion technique (4) to adapt this “GPS-peptidome” 
approach to search for N-terminal degron mo- 
tifs. We thereby directly examined the contribu- 
tion of N-terminal sequences to protein stability 
in human cells. 


Stability profiling of the human 
N-terminome using GPS-peptidome 
technology 


We synthesized an oligonucleotide library en- 
coding the first 24 amino acids of the primary 
isoform(s) of all human proteins, both with and 
without an initiator methionine (~50,000 


sequences). These were cloned into the “Ub-GPS” 
expression vector between the ubiquitin gene 
and GFP (Fig. 1A). Upon expression of the con- 
structs in human embryonic kidney (HEK) 293T 
cells, proteolytic cleavage of the ubiquitin moiety 
by endogenous deubiquitinating enzymes led to 
the exposure of the peptides at the N terminus of 
GFP (Fig. 1A). We used fluorescence-activated cell 
sorting (FACS) to partition the population into six 
bins of equal size on the basis of the stability of the 
peptide-GFP fusion. The stability of each fusion 
was then quantified with Illumina sequencing, 
with each peptide assigned a protein stability 
index (PSI) score ranging between 1 (maximally 
unstable) and 6 (maximally stable) according to 
the proportion of sequencing reads in each bin 
(data file S1). 

We began by assessing the effect of the ini- 
tiator methionine on protein stability. Overall, 
peptide-GFP fusions that lacked an initiator 
methionine were much less stable than their 
counterparts with an initiator methionine (Fig. 
1B). However, this effect was only observed for 
certain N-terminal residues (Fig. 1C). Reporters 
that begin with amino acids bearing small side 
chains (C, V, G, P, T, A, and S) were generally 
relatively stable and exhibited little or no dif- 
ference in overall stability, whether or not they 
were preceded by an upstream methionine resi- 
due. (Single-letter abbreviations for the amino 
acid residues are as follows: A, Ala; C, Cys; D, 
Asp; E, Glu; F, Phe; G, Gly; H, His; I, Ie; K, Lys; 
L, Leu; M, Met; N, Asn; P, Pro; Q, Gln; R, Arg; S, 
Ser; T, Thr; V, Val; W, Trp; and Y, Tyr.) This is 
consistent with efficient cleavage of the initiator 
methionine by methionine aminopeptidases when 
the following amino acid has a sufficiently small 
radius of gyration (74). By contrast, peptide-GFP 
fusions that begin with all other residues (except 
methionine itself) were generally stable only when 
preceded by an upstream methionine residue 
and were greatly destabilized in the absence of 
an initiator methionine (Fig. 1, C to F). 

Overall, these data provide strong support for 
a central role of the Arg/N-degron pathway in 
protein quality control. Whereas proteins bearing 
native N-termini [methionine itself, or C/V/G/P/ 
T/A/S, from which methionine is normally re- 
moved (/4)] are broadly stable, proteins bearing 
aberrant N-termini (R/K/H/W/Y/F/L/1/D/E/N/ 
Q, without a preceding methionine) are all highly 
unstable. The latter residues correspond perfect- 
ly to the primary type I (R/K/H), primary type II 
(W/Y/F/L/D, secondary (D/E), and tertiary (N/Q) 
N-terminal degrons of the Arg/N-degron path- 
way (fig. S1A). Crucially, however, when these 
residues were preceded by methionine—as they 
would be in the context of normal protein synthesis— 
broad stabilization was observed (Fig. 1F). 


Computational identification of 
destabilizing N-terminal motifs 


Subsequently, we focused on understanding the 
factors that determined the stability of peptide- 
GFP fusions synthesized with an initiator methi- 
onine. Stability scores for these fusions were 
distributed bimodally, with approximately one-third 
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Fig. 1. GPS profiling of the human N-terminome. (A) Schematic 
representation of the N-terminome GPS screen, in which the 

first 24 residues of all human proteins were expressed in the Ub-GPS 
vector as N-terminal fusions to GFP. (B) Distribution of protein 
stability scores observed from the screen depicted in (A). (©) Boxplots 
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with the indicated amino acid, when encoded either with (blue 

boxes) or without (orange boxes) an upstream methionine residue. 

(D to F) Heatmaps depicting the mean stability score for all peptides 
that begin with the indicated two amino acids, when encoded either with 
(E) or without (D) an upstream methionine residue; (F) illustrates the 


showing the distribution of stability scores for all peptides that begin 


of the library exhibiting significant instability 
(Fig. 1B, blue histogram). One key factor that 
strongly influenced stability was amino acid 
composition (Fig. 2, A and B). For example, 
aspartic acid and glutamic acid were depleted 
from unstable peptides and enriched among the 
stable peptides, whereas hydrophobic residues 
such as tryptophan, phenylalanine, isoleucine, and 
leucine showed the opposite pattern. This effect 
is not specific to the N terminus, however, be- 
cause similar rules govern the stability of re- 
porter constructs in which peptides are fused at 
the C terminus of GFP (73). 

Most amino acids exerted a similar effect on 
stability regardless of their position across the 
24-amino acid peptide, but we noticed that 
certain residues exerted differing effects spe- 
cifically when encoded at the second position 
(Fig. 2, A and B). We therefore performed a 
computational analysis to identify motifs that 
might promote instability specifically when lo- 
cated at or near the N terminus of the peptide. 
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For all possible combinations of dipeptide motifs, 
we compared the mean stability of all peptide- 
GFP fusions harboring the motif within the first 
seven N-terminal amino acids with those harbor- 
ing the motif at an internal position in the 
24-amino acid peptide (Fig. 2C and data file 
$2). Over 80% of the top 100 candidate destab- 
ilizing N-terminal motifs could be grouped into 
one of four categories solely based on the identity 
of the second residue: Lysine was present down- 
stream of the initiator methionine in 26 motifs, 
arginine in 24 motifs, glycine in 22 motifs, and 
cysteine in nine motifs (Fig. 2D). Reporters that 
encode these residues at the second position were 
significantly less stable than reporters that contain 
these residues at any internal position (Fig. 2E), and 
globally, peptide-GFP fusions that begin MC-, MR-, 
MG-, and MK- exhibited the lowest mean stability 
(Fig. 2F). Thus, considering initiator methionine 
removal, this analysis identified N-terminal glycine 
and cysteine in addition to MR- and MK- as can- 
didate destabilizing N-terminal motifs. 


difference between the two. 


Exploring the substrate repertoire of 
UBR family E3 ligases 

Next, we sought to identify the cellular machin- 
ery targeting each class of putative N-terminal 
degron. We began by investigating a role for UBR 
family E3 ligases. UBR1, UBR2, and UBR4: have 
been shown functionally to participate in the 
recognition of N-degrons (75), and so, through 
sequential rounds of CRISPR/Cas9-mediated gene 
disruption, we attempted to create a single-cell 
clone that lacks all three of these UBR proteins. 
Despite screening ~40 clones, we were unable 
to identify a clone in which simultaneous ab- 
lation of UBR1, UBR2, and UBR4: proteins was 
observed with immunoblot, suggesting that such 
a triple-mutant cell may not be viable. However, 
we were able to generate clones that express 
substantially reduced levels of two or more of the 
proteins (Fig. 3A). Ub-GPS reporters in which the 
initiator methionine of GFP was replaced with 
either arginine (R), lysine (K), or tyrosine (Y) 
were strongly destabilized in wild-type cells, but 
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Fig. 2. Identification of degron motifs 
located at protein N-termini. (A and B) 
The effect of peptide composition on 
protein stability. Shown are heatmaps 
depicting the relative depletion (blue) or 
enrichment (red) of each amino acid 
across all positions of the 24-amino 

acid peptide among (A) unstable 
peptides versus (B) stable peptides. 

(C to F) Computational prediction 

of N-terminal degrons. (C) For all possible 
combinations of dipeptide motifs, the 
mean difference in stability between 
peptides that contain the motif at 

the extreme N terminus (that is, 
immediately following the initiator 
methionine) was compared with 

all peptides that contain the motif 

at any other internal position in the 
peptide. (D) Classes of N-terminal 
degrons. The majority of the top 

100 predicted destabilizing N-terminal 
motifs encoded either glycine (G2), lysine 
(K2), arginine (R2), or cysteine (C2) at 
the second position; some example 
motifs are annotated. (E) Boxplots 
showing the distribution of stability 
scores for all peptides in which the 
indicated residues were encoded at the 
second position (colored boxes) versus 
any other internal position within the 
peptide (gray boxes). (F) Boxplots 
showing the distribution of stability 
scores for all peptides with the indicated 
residues encoded at the second position. 
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this effect was abrogated in UBR knockout (KO) 
clone 1 and clone 3 (fig. S2A) and completely 
abolished in clone 2 (Fig. 3B). 

We created a panel of Ub-GPS constructs 
in which either 23-amino acid peptides (Fig. 
2C) or 3-amino acid peptides (fig. S2B) that 
harbor example degron motifs downstream of 
an initiator methionine were fused to the N 
terminus of GFP. In both cases, loss of UBR 
proteins resulted in the stabilization of report- 
ers bearing three of the classes of degrons 
motifs: MK-, MR-, and N-terminal cysteine. 
However, loss of UBR proteins had little or no 
effect on the stability of the GFP-fusion pro- 
teins bearing N-terminal glycine, suggesting a 
role for additional E3 ligase(s) in the recog- 
nition of this particular N-terminal degron. 

It was not surprising that UBR E3 ligases 
targeted N-terminal cysteine, given that nitric 
oxide-mediated oxidation and subsequent argi- 
nylation of N-terminal cysteine renders it a sub- 
strate for the Arg/N-degron pathway (16). That 
said, ATE1 disruption only led to modest stabi- 
lization of two peptide-GFP substrates that ex- 
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suggesting that additional routes to UBR-mediated 
degradation must also exist. UBR-mediated degra- 
dation of proteins that begin MK- and MR- was 
unexpected, however, suggesting that in addition 
to targeting truncated proteins bearing abnormal 
N-termini, UBR ligases might also target certain 
intact proteins that bear their initiator methionine. 
To confirm that the initiator methionine of these 
substrates was intact, and thus rule out the 
possibility that methionine removal was instead 
exposing canonical Arg/N-degrons, we used mass 
spectrometry to examine the N terminus of two 
example peptide-GFP UBR substrates expressed 
in UBR KO clone 2 (fig. S3A). In both cases, we 
were readily able to detect the intact N-terminal 
peptide with the initiator methionine present, 
whereas we could not detect any peptides corre- 
sponding to a putative processed form without 
an initiator methionine (fig. S3B). 

To further examine this property of UBR pro- 
teins, we directly compared the stability of the 
entire Ub-GPS N-terminome library in wild-type 
cells versus UBR KO clones 1, 2, and 3 (Fig. 3D 
and data file S3A). Loss of UBR proteins had little 
effect on the overall stability of reporters syn- 
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thesized with an N-terminal methionine; only 
570 peptide-GFP fusion proteins (<3% of the 
N-terminome library) exhibited substantial sta- 
bilization (>0.8 PSI units) in any of the UBR mu- 
tant clones compared with control cells (Fig. 3E). 
Sequence analysis of the UBR substrates revealed 
aclear preference for particular N-terminal degron 
motifs (Fig. 3F and fig. $4, A to H). Consistent 
with our previous data (Fig. 2C and fig. S2B), 
peptides starting MC-, MK-, and MR- were all 
enriched. Peptides starting ML- and MI- were 
also overrepresented, and for three example 
peptides in each case, we validated that they 
were stabilized in UBR KO clone 2 (fig. S41). 
In Saccharomyces cerevisiae, Ubr1 has been 
shown to target proteins that start M®- (where 
® is a bulky hydrophobic residue, W/F/Y/L/]l) 
for degradation (17); however, unlike peptides 
that start ML- and MI-, we did not observe en- 
richment for peptides that start MF- or MY- among 
the UBR substrates, and observed only weak en- 
richment for peptides that start MW-. 

Last, only a small proportion of all peptides in 
the library that begin MK-, MR-, ML-, or MI- 
were UBR substrates, suggesting that additional 
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Fig. 3. Assessing the repertoire of UBR substrates among the 
human N-terminome. (A to C) Assessing UBR-mediated degradation 
through N-terminal degron motifs. (A) CRISPR/Cas9-mediated 
generation of clones expressing reduced levels of UBR1, UBR2, and 
UBR4. (B) Functional validation of UBR KO clones. Optimal Arg/N-end 
rule substrates were highly unstable in wild-type cells but not in 

UBR KO clone 2, as measured with flow cytometry (fig. S2A). (C) UBR 
proteins target example peptide-GFP reporters in which lysine, arginine, 
or cysteine, but not glycine, are encoded at the second position. 
N-terminal peptides derived from the indicated genes were expressed in 
wild-type or UBR KO clone 2 by using the Ub-GPS system, and their 
stability was assessed by means of flow cytometry. (D to F) Global 
identification of N-terminal UBR substrates. (D) Schematic representa- 


tion of the GPS screen. (E) Venn diagram summarizing the substrates 
stabilized >0.8 PSI units across the three UBR KO clones. (F) Heatmap 
showing the relative enrichment (red) or depletion (blue) of each 
amino acid across all positions of the 24-amino acid peptide comparing 
peptides stabilized in two or more of the UBR KO clones relative 

to the whole N-terminome library (fig. S4). (@ to I) Characterization of 
N-terminal UBR degrons through saturation mutagenesis. Each of 

the first 10 residues of the N-terminal peptides derived from (G) 
ZNF334 (beginning MK-), (H) AHRR (beginning MR-), and (1) CDX1 
(beginning MC-) were mutated to all other possible residues, and their 
stabilities were measured by means of FACS and Illumina sequencing. 
The darker the color, the greater the degree of stabilization as 
compared with that of the wild-type sequence (fig. S5). 


residues were essential for degron recognition. 
Analysis of the composition of all the UBR sub- 
strates identified in each category highlighted 
preferred residues enriched at downstream po- 
sitions (fig. S4, E to H). Furthermore, for some 
example peptides that start MK-, MR-, and MC-, 
we defined the N-terminal UBR degron in detail 
by performing saturation mutagenesis experi- 
ments. We created a Ub-GPS library in which 
each of the residues from position 2 to position 
10 of the 24-amino acid peptide were mutated 
to all other possible amino acids and measured 
the stability of the resulting peptide-GFP fu- 
sions by means of FACS and Illumina se- 
quencing (data file S4A). These experiments 
confirmed the critical importance of the lysine, 
arginine, or cysteine residue encoded at the sec- 


Timms et al., Science 365, eaaw4912 (2019) 


5 July 2019 


ond position but also demonstrated that certain 
mutations at the third or fourth position could 
prevent degron recognition (Fig. 3, G to I, and 
fig. S5). These data also confirmed the require- 
ment for these degron motifs to be positioned at 
the extreme N terminus because addition of just 
a single upstream amino acid (that is, immedi- 
ately after the initiator methionine) resulted in 
stabilization of the peptide-GFP fusions (Fig. 3, 
G to I, and fig. S5, column “add”). 


N-terminal glycine can act as a 
potent degron 


We next focused on the one class of N-terminal 
degron motif that was not a substrate for UBR- 
mediated degradation: N-terminal glycine. To 
validate that N-terminal glycine did indeed con- 


stitute a degron motif, we performed a series 
of mutagenesis experiments on a panel of un- 
stable Ub-GPS reporters in which 24-amino 
acid peptides starting MG- were fused to the 
N terminus of GFP (Fig. 4A and fig. S6A). In 
each case, the glycine residue was indeed crit- 
ical for instability because a single substitution 
converting the glycine residue to serine (G2S) 
was sufficient to inhibit degradation (Fig. 4A 
and fig. S6A, left). Moreover, the position of the 
glycine residue at the extreme N terminus was 
also critical because addition of a single serine 
residue upstream of the glycine (add S) stabi- 
lized the peptide-GFP fusions to a similar extent 
(Fig. 4A and fig. S6A, center). Last, consistent 
with the notion that the initiator methionine is 
constitutively cleaved when followed by a small 
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residue such as glycine, deletion of the initiator 
methionine (AMet) had no stabilizing effect on 
any of the peptide-GFP fusions (Fig. 4A and fig. 
S6A, right). 

For some example peptides, we defined the 
N-terminal glycine degron in detail by perform- 
ing saturation mutagenesis experiments (data 
file S4A). This confirmed the absolute require- 
ment for the exposure of glycine at the extreme 
N terminus because addition of any single 
amino acid upstream of the glycine resulted in 
stabilization of the peptide-GFP fusion (Fig. 4B 
and fig. S6, B to G, column “add”). The size of the 
degron motif appeared to be relatively small, but 
some substitutions at the residues immediately 
downstream of the exposed glycine did exert a 


stabilizing effect (Fig. 4B and fig. S6, B to G). 


Cul22¥48 and Cul275®! target 
N-terminal glycine 

We began the search for the E3 ligase(s) re- 
sponsible for targeting N-terminal glycine by 
using the small molecule MLN4924. MLN4924: 
acts as a broad inhibitor of Cullin-RING ligases 
(CRLs) by blocking Cullin neddylation (78), thus 
allowing us to narrow the search to either CRL 
or non-CRL ligase families. All our example Ub- 
GPS constructs bearing N-terminal glycine were 
stabilized upon treatment with MLN4924, im- 
plicating CRLs in the recognition of N-terminal 
glycine (Fig. 4C and fig. S7A). 

Next, we sought to identify the specific CRL 
adaptor(s) responsible for recognition of the 
N-terminal glycine degron. Using dominant- 
negative constructs to inhibit each of the major 


Cullins, we determined that either Cul2 or Cul5 
was responsible for the degradation of example 
Ub-GPS reporters harboring N-terminal glycine 
degrons (Fig. 4D and fig. S7B). Using these 
reporter substrates, we performed a series of 
CRISPR/Cas9-mediated genetic screens using a 
library of single-guide RNAs (sgRNAs) targeting 
known CRL2/5 substrate adaptor proteins (fig. 
S8A). Together, these screens identified ZYG11B 
as the CRL2 substrate adaptor responsible for 
recognition of the N-terminal glycine degron 
motif (Fig. 4E, fig. S8B, and data file S5). In- 
triguingly, ZER1, which is closely related to 
ZYGI11B (29% amino acid identity) (fig. S8C), 
was enriched at or approaching the level of 
statistical significance in several screens, sug- 
gesting that these two related adaptors may 
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Fig. 4. Cul27¥S"8 and Cul27£®! target N-terminal glycine. (A) Glycine 
at the N terminus can act as a potent degron. The N-terminal peptide 
derived from SNX11 and a mutant version that lacked the initiator 
methionine (AMet) were highly unstable, whereas mutant versions in which 
the terminal glycine was mutated to serine (G2S) or in which a serine 
residue was added between the initiator methionine and the glycine 
residue (add S) were not (fig. S6A). (B) Defining N-terminal glycine 
degrons through saturation mutagenesis. Each of the first 10 residues of 
the SNX11 peptide were mutated to all other possible residues, and 

their stabilities were measured by means of FACS and Illumina sequencing. 
The darker the color, the greater the degree of stabilization as compared 
with that of the wild-type sequence (fig. S6B). (© and D) Cul2 complexes 
target N-terminal glycine. Stabilization of the SNX11-GFP reporter (C) upon 
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treatment with the CRL inhibitor MLN4924, and (D) after expression of 
dominant-negative (DN) versions of Cullins (fig. S7). (E and F) CRISPR 
screens identify the Cul2 substrate adaptors responsible for the 
recognition of N-terminal glycine. (E) Results of the SNX11-GFP reporter 
screen, which highlighted two CRL2 complexes (F) (fig. S8). (G to J) 
Cul27’CH8 and Cul27*®! cooperate to target N-terminal glycine. (G) CRISPR- 
mediated ablation of both ZYG11B and ZERI1 was required for full 
stabilization of the SNX11-GFP reporter. (H) Exogenous expression of either 
ZYGI1I1B or ZER1 rescued degradation of the SNX11-GFP reporter in cells 
lacking endogenous ZYG11B and ZERI1. Knockout of ZYG11B and 

ZER1 stabilized full-length SNX11 fused to the N terminus of GFP, both (1) 
when expressed in the context of the Ub-GPS system or (J) without 
upstream ubiquitin fusion (fig. S9). 
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collaborate in the degradation of proteins that 
expose N-terminal glycine (Fig. 4F). The third 
member of the ZYG11 family, ZYG11A, did not 
score in any of the screens, which is consistent 
with RNA-sequencing data (19) showing that 
it is rarely expressed across human tissues (fig. 
S8, C and D). 

To examine the possibility of cooperation be- 
tween ZYGI11B and ZERI, we performed indi- 
vidual CRISPR/Cas9-mediated gene disruption 
experiments, ablating the function of ZYG11B or 
ZER1 either alone or in combination. Loss of 
ZYG11B alone did indeed stabilize all of the 
peptide-GFP fusion proteins (Fig. 4G and fig. 
S9A), but whereas complete stabilization was 
observed for two of the reporters, only partial 
stabilization was observed for the others. By 
contrast, loss of ZER1 alone had little stabi- 
lizing effect on any of the reporters; however, 
simultaneous disruption of both ZER1 and 
ZYGIIB resulted in complete stabilization (Fig. 
4G and fig. S9A). Furthermore, ZYG11B and 
ZER1 both associated with putative substrates 


that bear N-terminal glycine degrons (fig. S9B), 
and exogenous expression of either ZYG1IB or 
ZER1 alone in ZYGI1B/ZER1 double-mutant cells 
fully restored the degradation of a peptide-GFP 
fusion whose stabilization required ablation of 
both endogenous ZYGIIB and ZERI1 (Fig. 4H). 
Last, we validated that Cul27*°" and Cula7=™! 
were able to mediate the degradation of full- 
length proteins that bear exposed glycine resi- 
dues at their N termini (Fig. 4, I and J, and 
fig. S9, C to E). 

To obtain a global view of the substrates tar- 
geted by these Cul2 complexes, we compared 
the stability of the Ub-GPS N-terminome library 
in wild-type cells with cells that lack either 
ZYGI11B, ZER1, or both ZYG11B and ZERI (fig. 
S10A and data file S3B). First, this revealed 
that ZYG11B and ZER1 share the majority of 
their substrates: There were 115 fusions stabi- 
lized in ZYG11B mutant cells and 36 stabilized 
in ZER1 mutant cells, whereas 488 were stabi- 
lized in the double-mutant cells. Sequence anal- 
ysis of these shared substrates confirmed that 
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N-terminal glycine was the most enriched fea- 
ture while also highlighting preferred (F, G, H, 
K, and Y) and disfavored (D, E, I, P, S, and T) 
residues at the following position (fig. S10B). 
Of the substrates that were targeted solely by 
ZYGI11B, over 90% encoded a glycine residue at 
the second position (fig. S10C). Intriguingly, there 
was no enrichment of N-terminal glycine among 
the substrates exclusively targeted by ZER1 (fig. 
S10D). This finding suggested that (i) any ZER1 
substrates bearing an N-terminal glycine were 
also substrates for ZYG11B, and hence were still 
targeted for degradation in ZER1 mutant cells, 
and (ii) although N-terminal glycine was indis- 
pensable for recognition by ZYG11B, in some 
contexts ZER1 might recognize substrates that 
begin with residues other than glycine. We char- 
acterized one such substrate—the N-terminal 
peptide derived from KCNT2 (which begins 
MPYL)—in detail (fig. S11). In particular, satu- 
ration mutagenesis revealed that the hydrophobic 
residues encoded at the third and fourth posi- 
tion formed a critical part of the ZER1 degron, 
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Fig. 5. N-terminal glycine degrons are depleted from metazoan 
proteomes. (A to D) Defining the degrons recognized by ZYG11B and 
ZERI1 through saturation mutagenesis. Each of the first 10 residues of the 
SNX11 N-terminal peptide were mutated to all possible amino acids; the 
stability of each mutant in the resulting Ub-GPS library was measured in 
(A) wild-type, (B) ZER1 mutant, or (C) ZYG11B mutant cells. The color scale 
reflects the raw PSI measurement for each peptide-GFP fusion, so that the 
greater the intensity of the red color, the greater the stabilizing effect of 
the mutation. The heatmap in (D) illustrates the difference between the PSI 
in ZYG11B mutant cells versus ZER1 mutant cells; thus, a dark red color 
indicates mutations that prevent recognition by ZER1 but not by ZYG11B, 
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(favored for CRL2 degradation) (disfavored for CRL2 degradation) 
whereas a dark blue color indicates mutations that permit recognition by 
ZER1 but not by ZYG11B (fig. S13). (E) Normalized amino acid frequencies 
across the first 10 residues (following the initiator methionine) of human 
proteins. (F and G) Depletion of N-terminal glycine degrons in metazoan 
proteomes. The normalized amino acid frequency of glycine encoded at 
the second position in the indicated proteomes is shown by the blue dots, 
and is further categorized depending on whether the glycine residue is 
followed by a residue favoring (orange dots) or disfavoring (green dots) 
CRL2-mediated degradation. The relationship observed across animal 
proteomes (F) is not apparent across fungal proteomes (G), which do not 
possess a ZYGI1 ortholog. (***P < 0.001, Fisher's exact test) 
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whereas some more flexibility was tolerated at 
the second position (fig. S11G). However, the 
location of these residues relative to the front 
of the peptide remained critical because the 
addition of a single amino acid upstream of the 
proline residue prevented degradation (fig. S11G). 


Defining the N-terminal glycine degrons 
recognized by ZYG11B and ZER1 


To gain further insight into the specific degron 
motifs recognized by ZYG11B and ZER1, we 
examined a larger number of potential peptide- 
GFP substrates that begin with glycine (fig. S12). 
These could be divided into three categories: 
peptides fully stabilized upon mutation of 
ZYGIIB alone (fig. S12A); peptides stabilized 
partially upon mutation of ZYG11B alone, but 
which required combined mutation of ZYG11B 
and ZER1 for complete stabilization (fig. S12B); 
and peptides for which full redundancy was ob- 
served between ZYGIIB and ZERI (fig. S12C). 
For the vast majority of the peptides in the latter 
two categories, an aromatic residue (H, F, or Y) 
was located downstream of the terminal glycine, 
supporting the idea that ZER1 might preferen- 
tially recognize bulky residues located further 
along the peptide chain (fig. $12D). 

We tested this hypothesis more rigorously by 
repeating the saturation mutagenesis expe- 
riments in the genetic background of either 
ZYGI11B ablation or ZER1 ablation (data file 
S4, B and C). The results for some representative 
peptides are shown in Fig. 5, A to D, and fig. S13. 
Mutations promoting stability in wild-type cells 
were identical to those promoting stability in 
ZER1 mutant cells (Fig. 5, A and B). Therefore, 
these residues comprise the minimal N-terminal 
glycine degron, which is recognized by ZYG1IB. 
Conversely, the ZER1 degron (as revealed in 
ZYGI11B mutant cells) is more extensive because 
mutations that were two or more residues 
downstream of the terminal glycine inter- 
fered with degradation (Fig. 5, C and D). Over- 
all, these data support a model in which both 
ZYG11B and ZERI1 target substrates with ex- 
posed glycine residues at their N-termini; 
however, the recognition motif for ZYG11B is 
relatively small, comprising just the terminal 
glycine and the following residue, whereas 
the recognition motif for ZER1 may extend 
three or more residues along the polypeptide 
chain and preferentially comprises amino 
acids with bulky aromatic side chains. 


N-terminal glycine degrons are depleted 
from metazoan proteomes 


GPS-peptidome technology has already iden- 
tified a suite of degron motifs lying at the C 
terminus of human proteins (73). All of these 
degron motifs are depleted from the human 
proteome (13), suggesting evolutionary pres- 
sure to avoid degradation by E3 ligases that 
target terminal degrons. We thus examined 
the abundance of N-terminal glycine degrons in 
eukaryotic proteomes. As is the case for the 
residue at the extreme C terminus of eukaryotic 
proteins (73), the identity of the residue follow- 
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ing the initiator methionine at the N terminus 
was far more variable than at all neighboring 
positions, suggesting that its properties are par- 
ticularly important (Fig. 5E). Nonetheless, gly- 
cine was encoded at almost exactly the expected 
frequency at the second position across a range 
of metazoan model organisms (Fig. 5F, blue dots). 
However, classifying glycine residues as those fa- 
vored (G followed by F, G, H, L, M, or Y) or dis- 
favored (G followed by D, E, I, N, P, R, S, or T) 
for CRL2-mediated degradation revealed that, 
compared with sequences located internally, 
N-terminal glycine degron motifs are depleted 
from animal proteomes (Fig. 5F, orange dots), 
whereas N-terminal glycine motifs that are not 
efficiently recognized by ZYG11B and ZER1 are 
correspondingly enriched (Fig. 5F, green dots). 
As a control, we performed a similar analysis on 
a panel of reference fungal proteomes, which 
possess Cul2 but no ZYG11B-family ortholog 
(20). Consistent with the idea that there should 
be no selective pressure to avoid N-terminal 
glycine degrons in the absence of Cul27*4® 
and Cul2”"*", no such relationship was observed 
as in animal proteomes (Fig. 5G). Thus, the avoid- 
ance of N-terminal glycine motifs appears to have 
shaped the composition of metazoan proteomes. 


ZYG11B and ZER1 target protein 
fragments bearing N-terminal glycine 
after proteolytic cleavage 


Endoproteolysis generates an additional source 
of terminal degrons (21-23). Caspase cleavage 
preferentially occurs immediately upstream of 
glycine residues (Fig. 6A). Of the ~1800 known 
human caspase cleavage sites, approximately 
one-third result in the exposure of glycine at the 
N terminus of the downstream fragment (24), 
suggesting a potential role for ZYG11B and ZER1 
in the degradation of proteins cleaved during 
apoptosis. Moreover, in contrast to the situation 
at the native N-termini of human proteins (Fig. 
5F), we found that N-terminal glycine degrons 
that favor CRL2-mediated degradation were en- 
riched at caspase cleavage sites (Fig. 6B). 

We used GPS to assess a potential role for 
ZYGI11B and ZER1 in the removal of proteolytic 
fragments. We generated a Ub-GPS peptide li- 
brary in which the 24 residues downstream of 
all caspase cleavage events annotated in Degra- 
base (24) and PROSPER (25) were fused to the 
N terminus of GFP, and we profiled the stability 
of these peptide-GFP fusions in wild-type cells 
versus combined ZYG11B/ZER1 mutant cells 
(Fig. 6C and data file S6). The results confirmed 
that Cul27*C"® and Cul27"*" could target many 
caspase cleavage products: 225 substrates were 
stabilized >0.5 PSI units in both ZYG11B/ZER1 
double-mutant lines, of which 219 (97%) har- 
bored an N-terminal glycine residue (Fig. 6D; 
the GPS profiles of some example substrates are 
shown in Fig. 6E and fig. S14A). 

We validated these findings in two ways. First, 
for a panel of example cleavage products expos- 
ing N-terminal glycine degrons, we verified that 
the full-length protein fragments downstream of 
the cleavage site were stabilized in ZYGIIB/ZER1 


double-mutant cells (Fig. 6F). Second, we dem- 
onstrated that these fragments would also be 
substrates for ZYG11B and ZER1 after endopro- 
teolytic cleavage. Our initial attempts to perform 
these experiments by inducing the dimerization 
of caspase 9 (26) resulted in rapid cell death. 
Therefore, in order to decouple proteolytic cleav- 
age from cell death, we engineered mutant ver- 
sions of four example substrates in which the 
caspase cleavage site was replaced with the 
Tobacco Etch Virus (TEV) protease cleavage 
site (Fig. 6G). TEV protease recognizes the amino 
acid sequence ENLYFQ/G (where “/” represents 
the cleavage position), thus exposing an N-terminal 
glycine on the downstream fragment, and is ac- 
tive when expressed in mammalian cells (27, 28). 
Upon expression of TEV protease, we observed 
destabilization of the downstream cleavage pro- 
ducts that bear N-terminal glycine degrons in 
wild-type cells, but this effect was abrogated in 
ZYGI1B/ZER1 double-mutant cells (Fig. 6G and 
fig. S14B). Thus, ZYG11B and ZER1 are likely to be 
involved in the clearance of proteolytic fragments 
after caspase cleavage during apoptosis. 


ZYGI11B and ZER1 function in the quality 
control of N-myristoylated proteins 


Last, we considered whether the recognition of 
N-terminal glycine degrons might be condi- 
tionally regulated through posttranslational 
modifications. Intriguingly, N-myristoylation, the 
process through which the 14-carbon fatty acid 
myristate is attached to the N terminus of a 
subset of eukaryotic proteins (29), occurs exclu- 
sively on N-terminal glycine (Fig. 7A). Given 
that our mutagenesis experiments showed that 
addition of just a single amino acid to the N 
terminus prevented ZYGIIB- and ZER1-mediated 
recognition, we reasoned that N-myristoylation 
would prevent CRL2-mediated degradation from 
N-terminal glycine. Thus, we hypothesized that 
ZYGI1B and ZER1 might play an important role 
in “myristoylation quality control,” degrad- 
ing proteins that bear N-terminal glycine 
degrons conditionally exposed after a failure of 
N-myristoylation. 

Given that the N-myristoyltransferase enzymes 
(NMT1 and NMT2 in human cells) require less 
than the first 20 residues for substrate recognition 
(29), we reasoned that the peptide-GFP fusion pro- 
teins expressed from our N-terminome Ub-GPS 
library should undergo native N-myristoylation. 
In order to examine the effect of N-myristoylation 
on protein stability, we profiled the N-terminome 
Ub-GPS library in the presence or absence of 
NMT1/2 (Fig. 7B and data file S3C). Although we 
were not able to generate clones in which both 
NMT1 and NMT2 were completely ablated after 
CRISPR/Cas9-mediated gene disruption—a finding 
consistent with the notion that N-myristoylation 
is an essential process (30)—we did isolate three 
clones that retained only residual levels of one NMT 
enzyme as assessed by means of immunoblot (Fig. 
7C). When we analyzed the composition of all the 
peptide-GFP fusion proteins whose stability was 
substantially reduced in all three NMT1/2 mutant 
clones, N-terminal glycine was the most enriched 
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feature (Fig. 7D). Thus, a failure to undergo N- 
myristoylation can lead to instability of the un- 
modified protein. 

To investigate a possible role for ZYG11B and 
ZER1 in this process, we examined the stability 
of a panel of example substrates (fig. SI5A) in 
which N-terminal peptides derived from pro- 
teins known to undergo N-myristoylation (37) 
were expressed in the presence and absence of 
both NMT1/2 and ZYG11B/ZER1. These peptide- 
GFP fusion proteins were efficiently myristoy- 
lated, as evidenced by membrane localization in 
wild-type cells but not in NMT1/2 mutant cells 
(fig. SI5B). Validating the screen results, in each 
case we observed destabilization of the peptide- 


GFP fusion protein upon loss of NMT1/2 (Fig. 7E, 
yellow histograms); moreover, ZYG11B and ZER1 
were primarily responsible for this instability 
because complete or near-complete restabiliza- 
tion was observed upon ablation of both NMT1/2 
and ZYG11B/ZER1 (Fig. 7E, purple histograms). 
The true magnitude of this effect is likely to be 
even greater because addition of the small- 
molecule NMT1/2 inhibitor IMP-1088 (32) to the 
NMT1/2 mutant clones, thus inhibiting the 
residual N-myristoyltransferase activity remain- 
ing in the cell, further enhanced the destabiliza- 
tion of the peptide-GFP substrates (fig. S15C). 
Moreover, the small degree of stabilization ob- 
served with some of the fusion proteins upon 


ablation of ZYG11B and ZER1 in wild-type (that 
is, NMT1/2-sufficient) cells (Fig. '7E, top row) sug- 
gested that some fraction of protein molecules 
do normally escape N-myristoylation, emphasiz- 
ing the necessity for a degradative mechanism 
to remove these aberrant species. 

Last, we wanted to validate that endogenous 
N-myristoylated proteins behaved in a similar 
manner. We observed a significant reduction in 
the steady-state levels of a panel of example 
substrates in NMT1/2 mutant cells, which was 
abrogated upon concurrent ablation of ZYG1IB 
and ZER1 (Fig. 7F). However, unlike the com- 
plete or near-complete stabilization that we ob- 
served using the peptide-GFP fusion constructs 
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Fig. 6. ZYG11B and ZER1 target N-terminal glycine degrons generated 
through endoproteolytic cleavage. (A and B) Caspase cleavage prefer- 
entially generates fragments that bear N-terminal glycine. (A) Logoplot 
depicting the consensus sequence surrounding all caspase cleavage sites 
annotated in Degrabase (24). (B) Compared with their frequency across 
the human proteome, preferred glycine degrons (orange bar) are enriched 
at known caspase cleavage sites, whereas disfavored glycine degrons 
(green bar) are depleted. (C to E) Caspase cleavage events generate 
N-terminal glycine degrons targeted by ZYG11B and ZERI1. (C) Schematic 
representation of the caspase cleavage product Ub-GPS screen. 

(D) Heatmap showing the relative enrichment (red) or depletion (blue) of 
each amino acid across all positions of the 24-amino acid peptide 
comparing peptides stabilized in both ZYG1IB/ZER1 double mutant cells 
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with the whole caspase cleavage site library. (E) Profiles of example 
substrates. Residues flanking the caspase cleavage site (indicated by 
arrows) are shown (fig. S14A). (F and G) CRL2-mediated degradation of 
proteolytic cleavage products bearing N-terminal glycine degrons. (F) The 
full-length downstream caspase cleavage products of the indicated 
proteins were expressed by using the Ub-GPS system, and their stability 
was assessed in wild-type (gray) and dual ZYG11B/ZER1 mutant cells (red) 
by means of flow cytometry. (G) The caspase site in the indicated full- 
length open reading frames (ORFs) was replaced with a TEV protease 
cleavage site. Upon TEV expression (blue histograms), destabilization 

of the downstream cleavage products bearing N-terminal glycine degrons 
was observed in wild-type cells (top row) but not in dual ZYG11B/ZER1 
mutant cells (bottom row) (fig. S14B). 
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(Fig. '7E), here combined mutation of ZYG11B and 
ZER1 only resulted in partial restabilization. Thus, 
in the context of full-length proteins, multiple 
degrons in addition to N-terminal glycine may be 
exposed after a failure of N-myristoylation, ren- 
dering them substrates for additional E3 ligases. 
Altogether, these data demonstrate a physiological 
role for ZYGIIB and ZER1 in the surveillance of 
myristoylated proteins: Successful N-myristoylation 
shields proteins from degradation, but a failure 
to undergo N-myristoylation results in the expo- 
sure of N-terminal glycine degrons and CRL2- 
mediated degradation (Fig. 7G). 


Discussion 


We exploited GPS technology to directly exam- 
ine the contribution of N-terminal sequences to 
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protein stability across the human proteome. 
Unexpectedly, in addition to targeting abnor- 
mal proteins that lack an initiator methionine, 
we discovered that UBR-family E3 ligases also 
targeted proteins with a native N terminus in 
which an arginine or lysine residue follows an 
intact initiator methionine. We also found that 
cysteine exposed at the N terminus of GFP con- 
ferred instability in a UBR-dependent manner. 
Nitric oxide-mediated oxidation of N-terminal 
cysteine renders it a substrate for arginylation 
by ATEI and hence UBR-mediated degradation 
(16). However, here substrates that bear N-terminal 
cysteine were not stabilized to the same extent 
in ATE] mutant cells as in cells that lack UBR 
proteins; thus, if UBR proteins do not directly 
bind N-terminal cysteine, an ATE1-independent 


B Ub-GPS experiment using human N-terminome library 
in wild-type versus NMT1/2 knockout cells 


pathway must exist that permits this class of 
degrons to be recognized by UBR E3 ligases. 
Most notably, we uncovered an additional 
N-degron pathway centered on N-terminal gly- 
cine. There are intriguing mechanistic similar- 
ities between the ZYG11B- and ZER1-mediated 
recognition of N-terminal glycine degrons and the 
KLHDC2-, KLHDC3-, and KLHDC10-mediated 
recognition of C-terminal glycine degrons (13), 
with both processes involving multiple related 
members of CRL2 substrate adaptor families. 
Like the Kelch repeats found in the KLHDC 
family proteins, the leucine-rich repeats and the 
armadillo-like repeats present in the ZYG11 fam- 
ily adaptors also have the propensity to form 
solenoid structures (33), raising the possibility 
of a common structural mode through which 
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Fig. 7. Cul27¥¢"® and Cul27"" target proteins that fail to undergo 
N-myristoylation for proteasomal degradation. (A) N-myristoylation 
occurs on N-terminal glycine. (B) Schematic representation of the 

GPS screen designed to assess the effect of loss of N-myristoylation on 
protein stability. (C) Immunoblot validation of NMT1/2 knockout clones. 
Arrowheads indicate bands of the expected molecular weight. (D) Loss of 
N-myristoylation destabilizes peptide-GFP fusions that start with glycine. 
The heatmap shows the relative enrichment (red) or depletion (blue) 

of each amino acid across all positions of the 24-amino acid peptide, 
comparing the 91 peptides that exhibit substantial destabilization in all 
three NMT1/2 mutant clones relative to the whole N-terminome library. 
(E and F) ZYG11B and ZERI1 target N-terminal glycine degrons exposed 
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after a failure of N-myristoylation. (E) The first 24 amino acids from the 
indicated proteins were expressed as N-terminal fusions to GFP, and 
their stability in the indicated genetic backgrounds was measured by 
means of flow cytometry. The destabilization observed upon loss of 
NMT1/2 (gold histograms) is rescued in the absence of ZYG11B and ZER1 
(purple histograms). (F) The abundance of the indicated myristoylated 
proteins was assessed by means of immunoblot in either control (AAVS1) 
or ZYG11B/ZER1 double-mutant cells (DKO), either with or without 
simultaneous ablation of NMT1/2. Src, which did not exhibit significant 
destabilization in NMT1/2 mutant cells in the GPS screen, is shown as a 
negative control. (G) Model depicting the role of Cul27”S48 and Cul27ER 
in the quality control of N-myristoylated proteins. 
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terminal glycine residues are engaged (34). Fur- 
thermore, like their C-terminal counterparts, 
the ZYGI11 family of substrate adaptors have 
also shaped the proteome, with N-terminal 
glycine degrons being broadly avoided across 
metazoa. 

Our data suggests two contexts in which 
the targeting of N-terminal glycine degrons 
may play an important physiological role. N- 
myristoylation is a posttranslational modifi- 
cation that regulates the membrane localization 
and other properties of several hundred human 
proteins (29), a group that comprises notable 
members including Arf family GTPases, G pro- 
tein o subunits, and Src family tyrosine kinases 
(31). We propose a model in which a failure of 
N-myristoylation conditionally exposes N-terminal 
glycine degrons to ZYG11B and ZERI1, which 
are normally occluded upon successful modifi- 
cation. Further work will be required to ascertain 
whether other classes of terminal degrons func- 
tion in analogous quality control pathways to 
ensure the efficient deposition of posttransla- 
tional modifications. 

Furthermore, the strong enrichment for favored 
ZYG11B/ZER1 glycine degrons at the N-termini 
of known caspase cleavage products suggested a 
potential role for these CRL2 complexes during 
apoptosis, and we confirmed experimentally that 
many caspase cleavage events would generate 
substrates efficiently degraded by Cul27*0"8 and 
Cul2“"8", Following glycine, the next most com- 
monly generated N-terminal residue after caspase 
cleavage is serine, accounting for ~28% of anno- 
tated caspase sites. Intriguingly, in complete 
contrast to glycine, serine is the most stabilizing 
residue when exposed at the N terminus. Our 
caspase cleavage product GPS screen showed 
that fragments bearing N-terminal serine were 
generally extremely stable (fig. S14C). This may 
be useful where caspases need to activate a 
target, such as in the case of ataxia-telangiecta- 
sia mutated (ATM), whose C-terminal cleavage 
product acts in a dominant-negative manner 
to prevent DNA repair during apoptosis (35), 
or RAD21, whose C-terminal cleavage prod- 
uct acts as a pro-apoptotic factor (36). 

The importance of this glycine-specific N- 
degron pathway to human biology is underscored 
by the frequency of heterozygous loss-of-function 
mutations in humans for both ZYG1IB and ZER1 
being far lower than would be predicted. ZYG11B 
and ZERI1 both have a pLi value of 1 in the ExAC 
database (37), indicating that loss-of-function 
variants are strongly selected against in the 
heterozygous state, demonstrating potent haplo- 
insufficiency and counter selection in humans. 
Misregulation of Src-family tyrosine kinases could 
be deleterious to development. In Caenorhabditis 
elegans, the ZYGI11 ortholog is required for the 
metaphase-to-anaphase transition and M phase 
exit at meiosis IT (20, 38, 39). In humans, ZYG11B 
and ZER1 are both expressed in the testes and 
ovaries, and hence a similar role in the regula- 
tion of meiosis could also explain the strong 
selection against loss-of-function mutations. Alto- 
gether, the comprehensive analysis of N-terminal 
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degrons presented here has illuminated multiple 
aspects of N-degron proteolytic pathways and 
revealed that a family of E3 ligases specific for 
N-terminal glycine has shaped the human 
proteome. 


Materials and methods 
Cell Culture 


HEK-293T (ATCC” CRL-3216) cells were grown 
in Dulbecco’s Modified Eagle’s Medium (DMEM) 
(Life Technologies) supplemented with 10% fetal 
bovine serum (HyClone) and penicillin/streptomycin 
(Thermo Fisher Scientific). 


Transfection and lentivirus production 


Lentivirus was generated through the transfec- 
tion of HEK-293T cells using PolyJet In Vitro 
DNA Transfection Reagent (SignaGen Labora- 
tories). Cells seeded at approximately 80% con- 
fluency were transfected as recommended by 
the manufacturer with the lentiviral transfer 
vector plus four plasmids encoding Gag-Pol, 
Rev, Tat and VSV-G. The media was changed 
24 hours post-transfection and lentiviral super- 
natants collected a further 24 hours later. Cell 
debris was removed by centrifugation (800x g, 
5 min) and virus was stored in single-use aliquots 
at -80°C. Transduction of target cells was achieved 
by adding the virus in the presence of 8 ug/ml 
hexadimethrine bromide (Polybrene, Sigma- 
Aldrich). 


Inhibitors 


The proteasome inhibitor Bortezomib was ob- 
tained from APExBio and the pan-CRL inhibitor 
MLN4924 was obtained from Active Biochem; 
both were used at a final concentration of 1 uM. 
The NMT1/2 inhibitor IMP-1088 was purchased 
from Cayman Chemical and was used at a final 
concentration of 1 uM for 24 hours. 


Antibodies 


Primary antibodies used in this study were: 
rabbit anti-UBR1 (Bethyl, A302-988A), rabbit anti- 
UBR2 (Bethyl, A305-416A), rabbit anti-UBR4 
(Bethyl, A302-278A), rabbit anti-GFP (Abcam, 
ab290), rabbit anti-FOXJ3 (Bethyl, A303-107A), 
rabbit anti-ALKBH1 (Abcam, ab195376), rabbit 
anti-CHMP3 (Bethyl, A305-397A), mouse anti- 
vinculin (Sigma, V9131), rabbit anti-Fyn (Cell 
Signaling, 4023T), rabbit anti-LAMTOR (Cell Sig- 
naling, 8975T), rabbit anti-Yes (Cell Signaling, 
3201S), rabbit anti-Lyn (Bethyl, A302-683A-T), 
rabbit anti-NDUFAF4 (ABclonal, A14345), rab- 
bit anti-Sre (Cell Signaling, 2123T) and rabbit 
anti-GAPDH (Cell Signaling, 5174). The HA and 
FLAG epitope tags were detected using rat 
anti-HA peroxidase (Sigma-Aldrich, 12013819001) 
and rabbit anti-FLAG peroxidase (Cell Signaling, 
#20445), respectively. HRP-conjugated goat anti- 
mouse IgG and goat anti-rabbit IgG secondary 
antibodies were obtained from Jackson Immuno- 
Research (#111-035-003). 


Plasmids 


Lentiviral vectors encoding dominant-negative 
Cullin constructs were a generous gift from W. 


Harper. For exogenous expression of CRL2 sub- 
strate adaptors, the pDHRSIN-Psrry-GFP-WPRE- 
Ppex-Hygro vector was used (a gift from P. Lehner), 
with the constructs cloned in place of GFP using 
the Gibson assembly method (NEBuilder HiFi 
Cloning Kit). Plasmids encoding ZYGI1A and 
ZYGI1I1B were obtained from Addgene [plasmids 
#110550 and #110551, deposited by E. Kipreos 
(40)], while an entry vector encoding ZER1 was 
obtained from the Ultimate ORF Clone collec- 
tion (Thermo Fisher Scientific). A plasmid encod- 
ing TEV protease was also obtained from Addgene 
[plasmid #64276, deposited by X. Shu (27)]. 

For individual CRISPR/Cas9-mediated gene 
disruption experiments, the lentiCRISPR v2 vec- 
tor was used (Addgene #52961, deposited by 
Feng Zhang). Oligonucleotides encoding the 
top and bottom strands of the sgRNAs were 
synthesized (IDT), annealed and phosphorylated 
(T4: PNK; NEB) and cloned into the lentiCRISPR. 
v2 vector as described (47). Nucleotide sequences 
of the sgRNAs used were: 

sg-AAVS1: GGGGCCACTAGGGACAGGAT 

sgl-UBR1: GTGAGAGGATGGAAATCAGCG 

sg2-UBR1: GATTCTAACTTGTGGACCGAA 
sgl-UBR2: GAGGAGGAGAGAAGATGGCGT 
sg2-UBR2: GTACCCAAAATCTACTGCAG 
sgl-UBR4: GCCTCTCGAAGATGAACACCG 
sg2-UBR4: GCTGACCCCTGGACAGACAG 
sg-ATE1: GTATCAGGATCTCATAGACCG 
sgl-ZYG11B: GCGCTCGTAAGGATCCTCGA 
sg2-ZYG1IIB: GAAGCTCGAAGGCCAGAAAGC 
sgl-ZER1: GTATGAGGAGGAGAACCCAGG 
sg2-ZER1: GCCGCAGCAGGGACTCCACA 
sgl-NMT1: GCAGGGTGTAGGGCTCCTGG 
sg2-NMT1: GAAGCTCTACCGACTGCCAG 
sgl-NMT2: GATAGACGGGGACAATGAGG 
sg2-NMT2: GGACACGTGCGGGATAGACG 


Flow cytometry 


Analysis of HEK-293T cells by flow cytometry was 
performed on a BD LSRII instrument (Becton 
Dickinson) and the resulting data was analyzed 
using FlowJo. Cell sorting was performed on a 
MoFlo Astrios (Beckman Coulter). 


Generation of Ub-GPS libraries 


Human protein coding sequences were down- 
loaded from the Gencode database (release 27). 
The first 72 nucleotides of entries that (i) started 
with a methionine residue, (ii) had a transcript_ 
support_level equal to 1 or 2 and (iii) were com- 
mon to both the Ensembl and Havana databases 
were included in the oligonucleotide design. After 
removal of identical 72-nucleotide sequences, 
the final N-terminome library consisted of a total 
of 24,638 sequences. The oligonucleotide pool 
was synthesized by Agilent Technologies and 
amplified by PCR (Q5 Hot Start High-Fidelity 
DNA Polymerase, NEB). The PCR product was 
then cloned into the Ub-GPS vector between a 
unique SalI site engineered into the 3’ end of 
the ubiquitin gene and a unique NdeI site at the 
5’end of GFP using the Gibson assembly method 
(NEBuilder HiFi Cloning Kit), such that the 
resulting vector encoded the peptides immedi- 
ately downstream of ubiquitin, followed by a 


10 of 13 


RESEARCH | RESEARCH ARTICLE 


short linker (ATSALGT) and GFP (commencing 
SKGEEL-). At least 100-fold representation of 
the library was maintained at each step. 

Ub-GPS libraries for saturation mutagenesis 
were generated in an identical manner. For 
each peptide selected for analysis, each amino 
acid encoded at position 2 through to position 
10 was mutated to all other possible 19 amino 
acids. For each peptide 9 reference sequences 
were also synthesized, in which the same wild- 
type amino acid sequence was encoded by dif- 
ferent nucleotide sequences. 

Two databases were used to collate cleavage 
products for the caspase cleavage site Ub-GPS 
library: Degrabase (24) and PROSPER (25). All 
annotated cleavage sites in human proteins oc- 
curring after aspartic acid in each databases 
were included for oligonucleotide design, which, 
after removal of duplicates, resulted in a total of 
2,234 sequences. Amino acid sequence were con- 
verted into nucleotide sequences using the fol- 
lowing codons: 

A: GCC, C: TGC, D: GAC, E: GAG, F: TTC, G: GGC, 
H: CAC, I: ATC, K: AAG, L: CTG, M: ATG, N: AAC, 
P: CCC, Q; CAG, R: AGA, S: TCC, T: ACC, V: GTG, 
W: TGG, and Y: TAC. The oligonucleotide pool was 
synthesized by Agilent Technologies and amplified 
and cloned into the Ub-GPS vector as above. 


Ub-GPS screens 


GPS plasmid libraries were packaged into lenti- 
viral particles which were used to transduce HEK- 
293T cells at a multiplicity of infection of ~0.2 
(achieving approximately 20% DsRed* cells) and 
at sufficient scale to achieve ~500-fold coverage 
of the library (a total of ~12 million transduced 
cells in the case of the human N-terminome li- 
brary). Puromycin (1.5 ug/ml) was added two days 
post-transduction to eliminate untransduced 
cells. Surviving cells were pooled, expanded 
and then partitioned by FACS into six bins 7 days 
post-transduction based on the GFP/DsRed ra- 
tio. Genomic DNA was extracted from each of 
the pools separately (Gentra Puregene Cell Kit, 
Qiagen) and the fusion peptides amplified by 
PCR (Q5 Hot Start Polymerase, NEB) using a 
forward primer annealing to the end of the ubi- 
quitin gene and a reverse primer annealing to the 
front of GFP; sufficient reactions were performed 
to amplify a total mass of DNA equivalent to the 
mass of genomic DNA from cells representing 
500-fold coverage of the library. All PCR products 
were pooled, and one-tenth of the mix was puri- 
fied using a spin column (Qiagen PCR purifica- 
tion kit). Finally, 200 ng of the purified PCR 
product was used as the template for a second 
PCR reaction using primers to add the Illumina 
P5 sequence and a 7 bp ‘stagger’ region to the 5’ 
end, and Illumina indexes and P7 sequence at 
the 3’ end. Samples to be multiplexed were then 
pooled, purified on an agarose gel (QIAEXII Gel 
Extraction Kit, Qiagen) and sequenced on an 
Illumina NextSeq instrument. 


CRISPR screens 


A custom sgRNA library was designed targeting 
43 E2 enzymes, 11 core CRL components, and 
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109 CRL2/5 adaptors at a depth of 6 sgRNAs per 
gene. The sgRNA sequences together with flank- 
ing BbsI restriction enzyme recognition sites 
were synthesized by Twist Bioscience. The oligo- 
nucleotide pool was amplified by PCR (Q5 Hot 
Start Polymerase, NEB) and the product purified 
(Qiagen PCR purification kit) and digested with 
BbsI (NEB). The digested product was concen- 
trated by ethanol precipitation and then visual- 
ized on a 10% TBE PAGE gel (Thermo Fisher 
Scientific) stained with SYBR Gold (Thermo 
Fisher Scientific). DNA was isolated from the 
28 bp band using the “crush-and-soak” method, 
concentrated by ethanol precipitation, and then 
cloned into lentiCRISPR v2 (Addgene #52961) 
digested with BsmBI (NEB). 

The sgRNA library DNA was packaged into 
lentiviral particles. HEK-293T cells stably ex- 
pressing unstable peptide-GFP fusion proteins 
were transduced at a multiplicity of infection 
of ~0.3 at sufficient scale to maintain at least 
1000-fold representation of the library. Untrans- 
duced cells were eliminated through puromycin 
selection commencing two days post-transduc- 
tion. The top ~5% of the surviving cells based on 
the GFP/DsRed ratio were isolated by FACS, 
which was performed 7 days post-transduction. 
For each screen genomic DNA was extracted 
from both the sorted cells and the unselected 
library as a reference. The sgRNAs in both pools 
were amplified by PCR and sequenced on an 
Illumina NextSeq instrument. 


Immunoprecipitation and 
immunoblotting 


HEK-293T cells stably expressing epitope-tagged 
CRL2 substrate adaptors and peptide-GFP fusions 
were grown in 10 cm plates. Following treatment 
with Bortezomib (1 uM, 5 hours), cells were lysed 
in ice-cold lysis buffer (50 mM Tris, 100 mM NaCl, 
0.5% NP-40, pH 7.5 supplemented with EDTA-free 
protease inhibitor tablet and Phos-Stop phospha- 
tase inhibitor tablet (Roche)) for 30 min on ice. 
Nuclei were pelleted by centrifugation (14,000x g, 
10 min, 4°C). Beads coated with anti-HA (Pierce 
anti-HA magnetic beads, Thermo Fisher Scientific) 
or anti-FLAG (anti-FLAG M2 magnetic beads, 
Sigma-Aldrich) antibodies were added to the 
supernatants and incubated with rotation over- 
night at 4°C. The beads were then washed three 
times with lysis buffer before bound proteins 
were eluted upon incubation with SDS-PAGE 
sample buffer (95°C, 10 min). Proteins were 
subsequently resolved by SDS-PAGE (NuPAGE 
Bis-Tris gels, Thermo Fisher Scientific) and 
transferred to a nitrocellulose membrane (Trans- 
Blot Turbo System, Bio-Rad) which was then 
blocked in 10% nonfat dry milk in PBS + 0.1% 
Tween-20 (PBS-T). The membrane was incu- 
bated with primary antibody overnight at 4°C, 
and then, following three washes with PBS-T, 
HRP-conjugated secondary antibody was added 
for 1 hour at room temperature. Following a 
further three washes in PBS-T, reactive bands 
were visualized using Western Lightning Plus 
ECL (Perkin Elmer) and HyBlot CL film (Den- 
ville Scientific). 


Mass spectrometry 
UBR KO clone 2 cells stably expressing peptide- 
GFP fusions growing in 15 cm plates were lysed 
as described above, and immunoprecipitation 
performed in a similar way using GFP-Trap_MA 
magnetic agarose beads (Chromotek). Elution 
of the peptide- GFP fusion proteins was achieved 
by treatment with 2 M glycine for 1 min, followed 
by neutralization with 1 M Tris base, pH 10.4. 
Eluted proteins were reduced using DTT (Thermo 
Fisher) and alkylated with iodoacetamide (Sigma). 
Following TCA precipitation (Sigma), proteins 
were digested with Glu-C (Thermo Fisher) then 
cleaned up on C-18 stage tips (3M). 

Mass spectrometry data were collected using 
a Q Exactive mass spectrometer (Thermo Fisher) 
coupled with a Famos Autosampler (LC Pack- 
ings) and an Accela600 liquid chromatography 
pump (Thermo Fisher). Peptides were separated 
on a 100 um inner diameter microcapillary col- 
umn packed with ~25 cm of Accucore C18 resin 
(2.6 um, 150 A, Thermo Fisher). Peptides were 
separated using a 120 gradient of 5 to 25% 
acetonitrile in 0.125% formic acid at a flow rate 
of ~300 nl/min. The scan sequence began with 
an Orbitrap MSI spectrum with resolution 70,000, 
scan range 300-1500 Th, automatic gain control 
(AGC) target 1 x 10°, maximum injection time 
250 ms, and centroided data type. The top twenty 
precursors were selected for MS2 analysis which 
consisted of HCD (high-energy collision dissocia- 
tion) with the following parameters: resolution 
17,500, AGC 1 x 10°, maximum injection time 
100 ms, isolation window 1.6 Th, normalized col- 
lision energy (NCE) 27, and centroid spectrum 
data type. Unassigned charge states were excluded 
from MS” analysis, but singly charged species were 
included. Dynamic exclusion was set to automatic. 
Mass spectra were processed using a Sequest- 
based in-house software pipeline. 


Bioinformatics 
N-terminome Ub-GPS screen 


Raw Illumina reads derived from each GPS bin 
were first trimmed of constant sequences derived 
from the Ub-GPS vector backbone using Cutadapt 
(42). Resulting 72 nt reads were mapped to the 
reference input library using Bowtie 2 (43), and 
count tables were generated from reads that 
aligned perfectly to the reference sequence. Fol- 
lowing correction for sequencing depth, the pro- 
tein stability index (PSI) metric was calculated for 
each peptide-GFP fusion. The PSI score is given 
by the sum of multiplying the proportion of reads 
in each bin by the bin number (1-6 in this case), 
thus yielding a stability score between 1 (maxi- 
mally unstable) and 6 (maximally unstable): 
PSI = ae R; « i(where 7 represents the num- 


ber of the bin and R; represents the proportion 
of Illumina reads present for a peptide in that 
given bin 7). Read counts and associated stab- 
ility score for each peptide-GFP fusion are de- 
tailed in data file S1. 


Prediction of destabilizing N-terminal motifs 


The stability data derived from the Ub-GPS N- 
terminome screen was used to identify potential 
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destabilizing N-terminal degron motifs (Fig. 2, C 
and D). Varying exactly two residues at a time 
between position 2 and position 7, for all possible 
combinations of di-peptide motifs (allowing gaps) 
the mean PSI of all peptides containing that motif 
at the front of the peptide (that is, immediately 
following the initiator methionine) was compared 
to the mean PSI of all peptides containing the 
same motif at any internal location within the 
24-nucleotide oligomer peptide. The full data 
for all possible N-terminal motifs is tabulated in 
data file S2. 


N-terminome Ub-GPS screen in different 
genetic backgrounds 


The Ub-GPS N-terminome screens with the UBR 
mutant clones and the ZYG1IB and ZER1 mutant 
cells were performed in a similar manner as above, 
except that only the half of the N-terminome 
library comprising peptides bearing an initiator 
methionine was used. Read counts for all peptide- 
GFP fusions are detailed in data file S3A. Sub- 
sequently, comparisons between wild-type cells 
and combined ZYG11B/ZER1 mutant cells (data 
file S3B) and between wild-type, control (AAVS1) 
mutant and three NMT1/2 mutant clones (data 
file S3C) were performed in a similar way. In each 
case, a APSI score was generated for each peptide- 
GFP fusion reflecting the difference in raw PSI 
scores between the wild-type or control mutant 
cells and the experimental mutant cells. For the 
plots shown in Fig. 3 and fig. S4, peptide-GFP 
fusions were defined as UBR substrates if they 
were stabilized >0.8 PSI units (Fig. 3E) or >0.6 PSI 
units (fig. S4, A to D) in the UBR KO clone 
compared to control cells, but also not stabilized 
>0.3 PSI units in either ZYG1IB or ZER1 mutant 
cells. The logoplots shown in fig. $4, E to H were 
generated with iceLogo (44) and rendered using 
Seq2Logo (45): the “experimental set” comprised 
all peptides starting with the indicated motif that 
were identified as a UBR substrate in any of the 
three UBR KO clones, the “reference set” com- 
prised all peptides in the N-terminome library 
starting with that same motif, and the “percentage 
difference” scoring system was used. Residues sig- 
nificantly enriched at P < 0.05 are displayed. For 
the heatmap shown in Fig. 7D, peptide-GFP 
fusions destabilized >0.5 PSI units in all three 
NMT1/2 mutant clones were included for analysis. 


Saturation mutagenesis Ub-GPS screens 


The heatmaps displayed in Figs. 3, G to I, and 4B 
and figs. S5, A to C, and S6 illustrate the dif- 
ference between the PSI for each individual 
mutant peptide and the median PSI of all the 
unmutated peptides; the darker the red color, 
the greater the stabilizing effect of the muta- 
tion. For the heatmaps shown in Fig. 5, A to C, 
and fig. S13 the color scales indicate the raw PSI 
stability measurement, which lies between 1 (max- 
imally unstable; dark blue) and 6 (maximally 
stable; dark red); the exception is the compar- 
ison between ZYG11B mutant and ZER1 mutant 
cells (right columns) where a APSI score reflect- 
ing the difference between raw PSI score in 
ZYGI11B mutant cells and ZER1 mutant cells for 
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each peptide is depicted. The full data for all 
mutant peptides in all genetic backgrounds is 
detailed in data file S4, A to C. 


CRISPR screens 


Constant regions derived from the backbone of 
the lentiCRISPR v2 expression vector were re- 
moved from Illumina reads using Cutadapt, and 
count tables were generated from the remain- 
ing variable portion of the sgRNA sequences 
using Bowtie 2. The Model-based Analysis of 
Genome-wide CRISPR/Cas9 Knockout (MAGeCK) 
algorithm (46) was used to rank the performance 
of individual genes targeted by multiple sgRNAs 
enriched in the selected cells versus the unsorted 
populations. The full MAGeCK output for each 
screen is detailed in data file S5. For the scat- 
terplots shown in Fig. 4E and fig. SSB, the MAGeCK 
score plotted on the y axis is calculated as the 
negative loglO of the “pos|score” value gener- 
ated by MAGeCK. 


Proteome composition analysis 


Canonical protein sequences were downloaded 
from the Swiss-Prot database. For each position 
between position 2 and position 10 at the N 
terminus of the proteins the total abundance 
of each amino acid was quantified, expressed 
as a proportion of the total number of protein 
sequences, and then normalized to the mean 
proportion observed across the 9 N-terminal 
residues between position 2 and position 10. 
For the analysis of N-terminal glycine degrons 
shown in Fig. 5, F and G, we further categorized 
glycine residues as either “favored” for CRL2- 
mediated degradation through ZYG11B and ZER1 
if they were followed by F, G, H, L, M or Y, or 
“disfavored” if followed by D, E, I, N, P, R, S or T. 
The mean abundance of Gryorea aNd Gaistavorea 
across the 8 N-terminal residues between position 
3 and position 10 was then compared to their 
abundance at position 2. For the analysis of 
human caspase cleavage sites presented in Fig. 
6A, all unique cleavage sites occurring down- 
stream of aspartic acid (D) annotated in Degrabase 
1.0 (24) were analyzed using iceLogo. In Fig. 6B, all 
unique cleavage sites occurring downstream of 
aspartic acid (D) and upstream of glycine (G) 
were analyzed; the frequency of Grayoreq and 
Gaisfavorea at the N terminus of these caspase 
cleavage products was compared to the frequency 
Of Gaworea ANA Gaisfavorea at all glycine residues in 
the human proteome. 


Caspase cleavage site Ub-GPS screen 


The Ub-GPS screen with peptides derived from 
human caspase cleavage sites was analyzed in 
the same way as above, generating a APSI score 
for each peptide reflecting the difference in raw 
PSI scores between either control (AAVS1) mu- 
tant cells or combined ZYG11B/ZER1 double 
mutant cells versus wild-type cells (data file S6). 
For the heatmap shown in Fig. 6D, peptide-GFP 
fusions stabilized >0.5 PSI units in both ZYG11B/ 
ZER1 double mutant cell lines but <0.25 PSI 
units in control knockout cells were included; 
the intensity of the colors represent the depletion 


(blue) or enrichment (red) of each amino acid 
comparing this pool of ZYG11B/ZER1 substrates 
to all peptides detected in the caspase cleavage 
site library. 
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INTRODUCTION: The development of precise 
neural circuits is initially directed by genetic 
programming and subsequently refined by 
neural activity. In the mouse olfactory system, 
axons from various olfactory sensory neurons 
expressing the same olfactory receptor converge 
onto a few spatially invariant glomeruli, gen- 
erating the olfactory glomerular map in the 
olfactory bulbs. During development, olfactory 
receptors instruct axon sorting to form discrete 
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glomeruli. Olfactory receptors generate a com- 
binatorial code of axon-sorting molecules whose 
expression is regulated by neural activity. How- 
ever, it remains unclear how neural activity 
induces olfactory receptor-specific expression 
patterns of axon-sorting molecules. 


RATIONALE: The prevailing model for the 
activity-dependent development of neural cir- 
cuits postulates an interaction between pre- 
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Firing pattern—dependent olfactory map formation. Top left: Diverse patterns of 
spontaneous neural activity in olfactory sensory neurons. Bottom left: A combinatorial 
expression pattern of axon-sorting molecules at axon termini of olfactory sensory neurons. 
Right: A model for activity-dependent olfactory map formation. OE, olfactory epithelium; OB, 
olfactory bulb. Axon-sorting molecules for glomerular segregation: red, Kirrel2 (kin of 
IRRE-like protein 2); green, Sema7A (semaphorin 7A); blue, PCDH10 (protocadherin 10). 
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and postsynaptic neurons. In Hebbian plasticity, 
the correlated activity of pre- and postsynaptic 
neurons strengthens synaptic connections, where- 
as uncorrelated activity or lack of activity weakens 
them. However, this theory does not explain 
activity-dependent mechanisms in olfactory map 
formation. Axons of olfactory sensory neurons 
can converge to form glomerular-like structures 
even in mutant mice lacking synaptic partners, 
suggesting another activity-dependent mech- 


anism for glomerular seg- 
regation. The involvement 
Read the full article of neural activity in olfac- 


tory map formation has 
been demonstrated by ex- 
perimental suppression of 
neural activity. Here, we 
asked how neural activity is involved in the ex- 
pression of axon-sorting molecules regulating 
glomerular segregation. 


at http://dx.doi. 
org/10.1126/ 
science.aaw5030 


RESULTS: We performed calcium imaging 
experiments and optogenetic stimulation to 
address how neural activity generates olfactory 
receptor-specific expression patterns of axon- 
sorting molecules. Calcium imaging of olfactory 
sensory neurons revealed that the temporal 
patterns of spontaneous neuronal spikes were 
not spatially organized, but rather were cor- 
related with the olfactory receptor types. Re- 
ceptor substitution experiments demonstrated 
that olfactory receptors determine spontaneous 
activity patterns. Moreover, optogenetically dif- 
ferentiated patterns of neuronal activity induced 
expression of corresponding axon-sorting mole- 
cules and regulated glomerular segregation. 


CONCLUSION: We have demonstrated an in- 
structive role of neural activity in olfactory map 
formation. We propose an activity-dependent 
mechanism, different from Hebbian plasticity 
theory, in which specific patterns of sponta- 
neous activity determined by the expressed 
olfactory receptor type contribute to gener- 
ating the combinatorial code of axon-sorting 
molecules for olfactory receptor-specific axon 
sorting. 

Neural activity is involved in various aspects 
of brain development and function. Our find- 
ings show that in the olfactory system, gene 
expression that regulates neural circuit for- 
mation is dependent on neural firing patterns. 
With this strategy, neurons can generate varia- 
tion through diversifying gene expression. The 
pattern-dependent gene regulation may also 
expand beyond development to plastic changes 
in neural circuits throughout the lifetime. 


The list of author affiliations is available in the full article online. 
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Neural circuits emerge through the interplay of genetic programming and activity- 
dependent processes. During the development of the mouse olfactory map, axons 
segregate into distinct glomeruli in an olfactory receptor (OR)—dependent manner. ORs 
generate a combinatorial code of axon-sorting molecules whose expression is regulated 
by neural activity. However, it remains unclear how neural activity induces OR-specific 
expression patterns of axon-sorting molecules. We found that the temporal patterns 

of spontaneous neuronal spikes were not spatially organized but were correlated with 
the OR types. Receptor substitution experiments demonstrated that ORs determine 
spontaneous activity patterns. Moreover, optogenetically differentiated patterns 

of neuronal activity induced specific expression of the corresponding axon-sorting 
molecules and regulated axonal segregation. Thus, OR-dependent temporal patterns 

of spontaneous activity play instructive roles in generating the combinatorial code 

of axon-sorting molecules during olfactory map formation. 


n the mammalian brain, the development of 

precise neural circuits is initially directed by 

intrinsic genetic programming and subse- 

quently refined by neural activity (7-3). In 

the mouse olfactory system, individual olfac- 
tory sensory neurons (OSNs) express a single 
gene of a functional olfactory receptor (OR) out 
of >1000 OR genes (4-6). Axons from various 
OSNs expressing the same OR converge onto a 
few spatially invariant glomeruli, generating 
the olfactory glomerular map in the olfactory 
bulbs (OBs) (7-9). The olfactory glomerular map 
forms through two processes: global targeting 
and glomerular segregation (10, 11). OSN axons 
are first guided to approximate target regions 
according to gradients of axon guidance mol- 
ecules (72, 13) and are subsequently sorted into 
specific glomerular structures in an activity- 
dependent manner (11, 14-16). In glomerular seg- 
regation, OSN axons discriminate expressed OR 
types. Substitution of an OR-coding DNA se- 
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quence with another sequence results in allopat- 
ric generation of glomeruli that are distinct from 
the original ones (17, 18). Further, suppression of 
spontaneous neural activity disrupts the glomer- 
ular segregation (15). However, it remains a mys- 
tery how neural activity mediates the OR-specific 
glomerular segregation. 

The most prevailing model for the activity- 
dependent development of neural circuits postu- 
lates the interaction between pre- and postsynaptic 
neurons. In Hebbian plasticity (19), the corre- 
lated activity of pre- and postsynaptic neurons 
strengthens synaptic connections, whereas un- 
correlated activity or lack of activity weakens 
them. However, this theory does not explain 
activity-dependent mechanisms for axon sorting 
before the formation of synapses. OSN axons are 
capable of converging to form glomerular-like 
structures even in mutant mice lacking synaptic 
partners (20, 27). These findings suggest another 
activity-dependent mechanism for glomerular 
segregation. 

OR molecules control the expression of vari- 
ous axon-sorting molecules, which serve to regu- 
late glomerular segregation through their adhesive 
or repulsive interactions (22-26). These molecules 
exhibit both mosaic and glomerular-specific ex- 
pression patterns (27). However, their spatial pat- 
terns are not identical in the OB. As a result, OR 
identity is represented as a unique combinatorial 
code of axon-sorting molecules at the axon ter- 
mini, which provides the self-identification tags 
for OR-specific glomerular segregation. Because 
the expression of axon-sorting molecules is activity- 
dependent (22, 25), neural activity is expected to 
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bridge between the OR types and the combina- 
torial expression of the axon-sorting molecules. 
Previous studies reported that spontaneous fir- 
ing rates are different among OSN classes in 
several species (28-32). However, it remains un- 
clear whether neural activity functions in an in- 
structive way or a permissive way; it is not clear 
whether specific features of neural activity pat- 
terns are important or whether the presence of 
neural activity is sufficient to trigger a predeter- 
mined gene expression. Here, we applied genetic 
approaches to address how neural activity gener- 
ates OR-specific expression patterns of axon- 
sorting molecules. 


Activity-dependent expression of 
axon-sorting molecules 


On the basis of our previous study (27), we fo- 
cused on Kirrel2, Sema7A, and PCDH10 as the 
axon-sorting molecules for glomerular segrega- 
tion. Genetic knockout (KO) experiments dem- 
onstrated that these molecules are involved in 
neural circuit formation in the mouse olfactory 
system (22, 25, 33) (fig. S1). The expression levels 
of these molecules are uniquely correlated with 
OR types (22, 24, 27). However, the expression 
patterns differed among glomeruli in the OB 
(Fig. 1A). For example, Kirrel2 expression was 
high in the I7 glomeruli but low in the MOR28 
and M71 glomeruli. In contrast, PCDH10 expres- 
sion was high in the M71 glomeruli, middle in 
the I7 glomeruli, and low in the MOR28 glo- 
meruli (Fig. 1, B and C). Our previous study 
showed that the expression levels of these mole- 
cules dropped substantially to undetectable levels 
in KOs of the cyclic nucleotide-gated (CNG) 
channel (27), a key component of the olfactory 
signal transduction pathway (34). High-KCl treat- 
ment of olfactory tissues in ex vivo culture condi- 
tions increased the expression of these genes, and 
this effect was blocked by the depletion of extra- 
cellular calcium ions or inhibition of the voltage- 
gated L-type calcium channel (fig. $2). These 
results indicate that calcium influx associated 
with neural activity is required for generating the 
combinatorial code of the axon-sorting molecules. 


Visualizing OSN spontaneous activity 


To examine the nature of OSN spontaneous ac- 
tivity, we used a Cre/loxP approach [pGoofy-Cre: 
pCAG-LSL-GCaMP6f (Ai95D) mice] to express 
the genetically encoded calcium indicator GCaMP6f 
(35) specifically in OSNs (Fig. 2A). In the OSN- 
specific GCaMP6f mice, action potentials were 
reliably detected as transient increases in the 
fluorescence intensities of GCaMPé6f (fig. S3). 
Correlation analyses revealed that spontaneous 
calcium events were not spatially or temporally 
correlated between OSNs in ex vivo acute olfac- 
tory epithelium (OE) slices from neonatal mice 
(Fig. 2, A and B; see also movie S1), which sug- 
gested that spontaneous calcium transients are 
sporadic events and occur independent of neigh- 
boring OSNs. These results are in contrast to the 
spontaneous activity patterns observed in the 
developing retina, which generates temporally 
correlated activity among neighboring retinal 
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Fig. 1. OR-specific 
expression patterns of 
axon-sorting molecules. 
(A) Glomerulus-specific 
expression of axon-sorting 
molecules. A parasagittal 
OB section from a 
2-week-old mouse was 
immunostained using 
antibodies to Kirrel2 (red), 
PCDH1O (blue), and 
Sema7A (green). Enlarged 
images are shown at the 
right. AOB, accessory 
olfactory bulb. Scale bars, 
100 um. (B) Expression 
levels of axon-sorting 
molecules in OSNs 
expressing specific ORs. 
The glomerular locations 
of MOR28-, I7-, and B 
M71-expressing OSNs 
were identified by immuno- 
staining with the indicated 
antibodies (indicated by 
arrowheads). Adjacent 
sections were immuno- 
stained using antibodies 
to Kirrel2 and PCDH1O. 
Scale bars, 100 um. 

(C) Rank order of expres- 
sion levels of Kirrel2 

and PCDH1O. The signal 
intensities of Kirrel2 and 
PCDH1O0 were quantified for 
each glomerulus and 
plotted in order (N = 718 
glomeruli for Kirrel2, N = 
673 glomeruli for PCDH10). 


Kirrel2 


\7 


M71 


ganglion cells and regulates precise circuit for- 
mation (10, 36-39). In the CNG-KO mice, spon- 
taneous activity was still present (34) (fig. S3E, 
see also movie S2) but in altered patterns; the 
event frequencies of calcium spikes and the 
inter-event intervals (IEIs) (frequency = 0.90 + 
1.02 times/min, TEI = 59.09 + 61.52 s; mean + SD, 
NN = 230 cells) were lower and longer, respec- 
tively, relative to those in wild-type mice (fre- 
quency = 2.05 + 2.48 times/min, IEI = 48.14 + 
53.09 s; mean + SD, N = 252 cells). Because CNG 
channel-mediated activity is essential for the ex- 
pression of Kirrel2, Sema7A, and PCDH1O0 (27), 
these observations indicate that specific features 
of neural activity (e.g., frequency, IEI), rather than 
the presence of neural activity, are important for 
the expression of axon-sorting molecules, leading 
to the hypothesis that neural activity is instructive 
for glomerular segregation. 


ORs determine spontaneous 
activity patterns 


We next compared the temporal features of spon- 
taneous calcium transients between OSNs expres- 
sing different OR types. We selected two types of 
OR-defined neurons based on the expression pro- 
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files of the axon-sorting molecules: (i) I7-expressing 
OSNs, in which expression levels of Kirrel2 are 
high, and (ii) MOR28-expressing OSNs, in which 
expression levels of Kirrel2 are low (Fig. 1, B and 
C, and fig. S4). We generated the transgenic mice 
H-I7-Cre-mcherry and H-MOR28-Cre-mcherry 
(hereafter abbreviated as H-OR-Cre lines), in 
which Cre recombinase was coexpressed with 
17 or MOR28 (Fig. 2C and fig. S4). Because the 
forced expression of OR from a transgene sup- 
presses endogenous OR expression (6, 40, 41), 
the monogenic OR expression is maintained in 
OSNs expressing the transgenes. The H-OR-Cre 
lines were then crossed with Ai95D mice to vis- 
ualize spontaneous calcium transients of OSNs 
expressing specific ORs. We compared the fea- 
tures of calcium spikes, including baseline fluo- 
rescence, total event number, IEI, peak amplitude, 
coefficient of variation (CV), and rise and fall 
times of calcium spikes between I7- and MOR28- 
expressing neurons (NV = 59 cells for I7, N = 61 cells 
for MOR28). We found that calcium transient 
patterns were significantly different between 
17- and MOR28-expressing OSNs (Fig. 2C), even 
though both ORs were expressed under the con- 
trol of the same MOR23 promoter (42) (Fig. 2C and 
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fig. S4A). I7-expressing OSNs exhibited calcium 
transients of higher amplitudes and higher fre- 
quencies than those of MOR28-expressing OSNs 
(Fig. 2, C and D). We conducted multidimensional 
scaling (MDS) to plot the datasets of I7- and 
MOR28-expressing OSNs according to relative dis- 
tance (Fig. 2E). In the MDS space, OSNs expres- 
sing different ORs were significantly separated 
by a single line determined by the linear support 
vector machine (SVM) classifier (gray dashed line 
in Fig. 2E, Fl = 0.775, P < 0.0001), indicating that 
different ORs are sufficient to induce differing 
patterns of spontaneous calcium transients. 

To overview a variety of calcium transient pat- 
terns in OSNs, we next collected the features of 
calcium transients from a large number of OSNs 
LN = 3214 cells; N = 59 for I7 (red), N = 61 for 
MOR28 (green), NV = 45 for M71 (blue), and N = 
3049 for randomly selected OSNs (gray)]. The 
dataset was dimension-reduced by principal com- 
ponents analysis (PCA). In two-dimensional PC 
space, OSNs were clustered depending on the 
expressed OR types (Fig. 2F and fig. $5). Taken 
together, this result indicates that OSNs expressing 
the same OR type exhibited similar temporal pro- 
files of calcium transients. 
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Fig. 2. Spontaneous activity patterns correlate with OR types. (A) An image of an OE acute slice. 
OSN calcium transients are shown at the right. Scale bar, 50 um. (B) Pairwise cross-correlation analyses 
were performed for all possible pairs (N = 2701 pairs). Top: The distribution of the correlation indices for 
the real data was not significantly different from that for shuffled data (Mann-Whitney U test). Bottom: 
Correlation indices were plotted as a function of the intercell distance between cell pairs. Data are 
means + SEM. (C) Top: H-OR-Cre lines were crossed with Ai95D mice. Bottom: Representative calcium 
transients. (D) Features of spontaneous calcium transients were analyzed for I7- or MOR28-expressing 
OSNs. Results are shown as box-and-whisker plots displaying the median and interquartile ranges. 
Outliers are 1.5 times the interquartile range (IQR) either above the third quartile or below the first quartile. 
CV, coefficient of variation. ***P < 0.001, *P < 0.05 (Mann-Whitney U test). (E) Multidimensional 

scaling (MDS) of the calcium imaging dataset of OR-defined OSNs. The support vector machine (SVM) 
classifier is shown as a gray dashed line. (F) PCA biplot of the calcium imaging dataset from 3214 OSNs. 


Spike bursts increase Kirrel2 expression 
To assess the causal links between the neural 
activity patterns and the expression of axon- 
sorting molecules, we used an optogenetic ap- 
proach to manipulate the neural activity of OSNs. 
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OSN-specific channelrhodopsin-2 (ChR2) mice 
were generated by crossing pGoofy-Cre mice 
with pCAG-LSL-ChR2 (Ai32) mice (43) (Fig. 3A, 
top). Blue light pulses (3 ms at <40 Hz) could 
precisely control the neural activity of OSNs at 
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a single spike level, irrespective of the light 
power (fig. S6). The OSN-specific ChR2 mice 
were stimulated by blue light pulses with two 
different protocols, namely phasic burst stimu- 
lation Gntraburst frequency, 10 Hz; action po- 
tentials in burst, 15; interburst interval, 15 s) and 
tonic stimulation (Fig. 3B). In both protocols, 
light-induced action potentials were maintained 
at the overall firing rate of 1 Hz because the median 
firing rate of the spontaneous activity of OSNs 
was 1.1 Hz (fig. S7). We illuminated the unilateral 
dorsolateral side of the OE with blue light pulses 
for 12 hours according to the protocols and com- 
pared the changes in the expression of axon- 
sorting molecules between the light-illuminated 
and shaded sides of the OE (Fig. 3A, bottom). We 
found that in the phasic burst protocol, Kirrel2 
was up-regulated in the light-stimulated side 
relative to the shaded side (Fig. 3B). However, 
no significant change was observed in the tonic 
protocol. Kirrel2 expression was up-regulated 
in the phasic burst protocols when the interspike 
intervals (ISIs) of the induced spikes within bursts 
were less than 200 ms (Fig. 3C). Single-cell quan- 
tification further revealed that the intraburst 
frequency of the induced spikes had different 
impacts on Kirrel2 up-regulation (ISI = 50 ms, 
d = 0.74; ISI = 100 ms, d = 1.54; ISI = 200 ms, d= 
1.09) (Fig. 3D, left). Kirrel2 expression increased 
even in MOR28-expressing OSNs, where the endog- 
enous Kirrel2 expression levels are low (Fig. 3D, 
right), indicating that the change in the expres- 
sion levels of Kirrel2 was not attributable to a 
difference in the OSN population. In addition, the 
induced spike numbers within bursts affected 
Kirrel2 expression (Fig. 3E). 

To find the relationship between endogenous 
spontaneous activity and optogenetically induced 
firing, we conducted SVM classification in terms 
of Kirrel2 expression levels. The nonlinear SVM 
classifier based on endogenous activity patterns 
of Kirrel2-high (17) and Kirrel2-low (MOR28 and 
M71) OSNs could predict the light stimulation- 
induced Kirrel2 up-regulation (Fig. 3F, left). The 
optogenetic stimulation protocols that increased 
Kirrel2 expression (Fig. 3F, a to d) were plotted 
onto the Kirrel2-high side of the graph, whereas 
those that did not change Kirrel2 expression 
(Fig. 3F, e to g) were plotted onto the Kirrel2-low 
side. Further, OSNs in the I7 cluster frequently 
exhibited calcium transients with high amplitude 
that resembled phasic burst stimulation (Fig. 3F, 
right, calcium event frequency = 0.047 + 0.031 Hz, 
calcium event amplitude = 10.25 + 4.06%, mean + 
SD). In contrast, such high-amplitude calcium 
transients were not frequently observed in the 
MOR28 and M71 clusters (MOR28, frequency = 
0.011 + 0.008 Hz, amplitude = 8.28 + 2.98%, 
mean + SD; M71, frequency = 0.036 + 0.033 Hz, 
amplitude = 4.11 + 1.17%, mean + SD). Thus, we 
deduced that the phasic burst stimulations mi- 
micked the physiological patterns of neural ac- 
tivity for Kirrel2 expression. 

In contrast to Kirrel2, the expression of Sema7A 
and PCDH10 did not change during the series of 
experiments (Fig. 3C). A similar phenotype was 
also observed in the CNG-KO background, where 
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Fig. 3. Phasic burst stimu- A B 
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Bottom: Experimental sche- 
matic for the in vivo opto- 
genetic stimulation. (B) In 
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sections using probes for 
Kirrel2 and CNG channel 
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endogenous calcium transients were more static 
(fig. S8). These results demonstrate that the spike 
bursts specifically up-regulate Kirrel2 expression. 


Prolonged burst stimulation increases 
Sema7A and PCDH1O 


We next sought to determine which pattern of 
neural activity was necessary for the expression 
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calcium event frequency (Hz) 


of Sema7A and PCDHI1O0. Because they showed 
similar protein distribution patterns at the OSN 
axon termini (27), there must be a common reg- 
ulatory mechanism underlying their expres- 
sion. We stimulated OSNs with various burst 
duration protocols ranging from 1.5 to 720 s 
(Fig. 4A). We kept the overall firing rate at 
1 Hz and the spike frequency within bursts at 


5 July 2019 


10 Hz. Optogenetic stimulation with various 
burst duration protocols revealed that Sema7A 
and PCDHI1O0 were up-regulated by burst dura- 
tions for more than 10 s and 20 s, respectively 
(Fig. 4B). It is notable that the 360-s and 720-s 
burst duration protocols increased the expres- 
sion of Sema7A and PCDHI1O, but not that of 
Kirrel2 (Fig. 4, B and C). 
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Fig. 4. Prolonged burst stimulation increases Sema7A and PCDH10. 

(A) Schematic showing various stimulation protocols. (B) Expression 
changes of axon-sorting molecules upon different light stimulation protocols. 
Quantitative analyses were performed as in Fig. 3C. Error bars denote SEM 
(N = 6 to 8 mice per group). (C) In situ hybridization of the OE sections from 
OSN-specific ChR2 mice illuminated by blue light with the long-duration 
protocol (ISI, 100 ms; burst duration per hour, 360 s). Enlarged photos are 
shown at the bottom. Scale bars, 50 um. (D) Schematic showing various 


optogenetic stimulation protocols. (E) Expression changes of axon-sorting 
molecules upon various burst units. Quantitative analyses were performed as 
in Fig. 3C. Error bars denote SEM (N = 7 or 8 mice per group). *P < 0.05, 
**P < 0.01 [one-way ANOVA with post hoc Tukey test in (B) and (E)]. 

(F) Left: Long-term calcium imaging of PCDH10-high (M71) and PCDH10-low 
(MOR28) OSNs. Right: Number of active states per hour in M71- and 
MOR28-expressing OSNs. Data are means + SD. N = 10 cells (M71), 

11 cells (MOR28). ***P < 0.001 (Student t test). 


To investigate the minimum duration of bursts 
required for the expression of Sema7A and 
PCDH1O, we further tested the effects of various 
burst units by changing the burst duration ir- 
respective of the firing rate of the induced spikes 
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(Fig. 4D). To this end, OSNs were stimulated 
hourly with various types of single burst units 
(intraburst frequency, 10 Hz; burst duration, 10 
to 360 s) for 12 hours. We found that both genes 
were up-regulated by prolonged bursts. The sensi- 
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tivities to the burst duration differed between 
Sema7A and PCDHI1O; a 20-s burst duration in- 
duced the expression of PCDH10, but not that of 
Sema7A (Fig. 4E). The differential sensitivities of 
Sema7A and PCDH1O0 to the burst durations may 
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Fig. 5. Optogenetic stimulation affects axonal segregation. (A) Schematic for generating OSN- 
specific ChR2 mosaic mice. (B) Double staining of an OE section from the ChR2 mosaic mouse. An OE 
section was subjected to in situ hybridization using an OMP probe (red) and immunostaining with anti- 
bodies to green fluorescent protein (GFP) (green). (C) Immunostaining of an OB section with antibodies 
to GFP and Kirrel2 after chronic light stimulation (intraburst frequency, 10 Hz; APs within burst, 15; 
interburst interval, 15 s). Left: Kirrel2 up-regulation upon light stimulation is indicated by arrowheads. 
Right: Quantification of protein levels of axon-sorting molecules. *P < 0.05, **P < 0.01 (one-way ANOVA 
with post hoc Tukey test). See also data S5. (D) Immunostaining of OB sections using antibodies to 
Kirrel2, GFP, and MOR28. Kirrel2-positive and -negative MOR28* axons are indicated by arrowheads and 
arrows, respectively. (E) Numbers of MOR28 glomeruli per OB in ChR2 mosaic mice with and without 
light stimulation. A glomerular structure was semiautomatically defined by immunofluorescence signals 
of vesicular glutamate transporter2 (VGlut2). Box and whiskers in all graphs represent 25th to 75th and 
10th to 90th percentiles, respectively. Scale bars, 50 um. N = 6 glomeruli, **P < 0.01 (Student t test). 


explain their partially overlapping but different 
expression patterns in the OSN axon termini in 
the OBs (27). In a series of burst duration ex- 
periments (Fig. 4D), Kirrel2 expression was not 
changed (Fig. 4E). These results indicate that 
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Sema7A and PCDH10 expression is up-regulated 
by protocols of low-frequency and long-duration 
bursts, whereas Kirrel2 expression is up-regulated 
by protocols of high-frequency and short-duration 
bursts. 
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We also performed long-term calcium imaging 
for OSNs expressing M71 and MOR28 in which 
PCDH1O0 expression levels are high and low, re- 
spectively (Fig. 1, B and C). As shown in Fig. 4F, 
both types of OSNs exhibited stationary states 
during the majority of the recorded time. How- 
ever, we observed events of prolonged calcium 
elevation several times per hour in M71-expressing 
OSNs (Fig. 4F). We defined the active state accord- 
ing to the duration when OSNs showed a higher 
amplitude of calcium transients above aver- 
age for >20 s. The active state in M71-expressing 
OSNs lasted for 34.14 + 13.47 s with an IEI of 
445.85 + 376.03 s (mean + SD), which is enough 
to induce PCDHI0 expression. In contrast, the 
active state was hardly observed in MOR28- 
expressing OSNs [0.09 + 0.30 times/hour (NV = 
11 cells) versus 2.4 + 1.90 times/hour for M7I- 
expressing OSNs (N = 10 cells), means + SD]. 
These results suggest that prolonged burst stim- 
ulations and their impact on gene expression 
mimicked physiological conditions. 


Chronic optogenetic stimulation induces 
axonal segregation 


To examine whether in vivo optogenetic stimu- 
lation indeed affects segregation of OSN axons, we 
conducted optogenetic stimulation for a prolonged 
period of time. For this purpose, we used H-Cre 
transgenic mice (13). In this mouse, Cre recom- 
binase is expressed in a subset of OSNs as a 
result of the OR-mediated negative feedback 
signal (6). Therefore, ChR2-positive and ChR2- 
negative OSNs expressing the same OR can be 
generated in the same individuals by crossing 
the H-Cre transgenic mice with the Ai32 mice 
(Fig. 5, A and B). 

Neonatal OSN-specific ChR2 mosaic mice were 
stimulated for 5 to 6 hours per day with the phasic 
burst protocol used in Fig. 3B. After 1 week of 
light stimulation, we observed an increase in 
Kirrel2 proteins, but not in PCDH10 or Sema7A 
proteins, in ChR2-positive OSN axons (Fig. 5, C 
and D). The Kirrel2 up-regulation was also ob- 
served in the CNG-KO background where en- 
dogenous Kirrel2 expression is decreased (fig. 
S9). We found segregation of MOR28-expressing 
OSN axons according to ChR2 expression in the 
OB of the ChR2 mosaic mice (Fig. 5, D and E). A 
similar axonal phenotype was also found in I7- 
expressing OSN axons (fig. S10, A and B). The 
ChR2-dependent axonal segregation did not 
occur without light stimulation (Fig. 5, D and 
E, and fig. S10). These results indicate that the 
optogenetically induced phasic burst stimulation 
induced axonal segregation, presumably through 
Kirrel2 up-regulation. 


Discussion 


Hebbian plasticity explains the activity-dependent 
development of many neural circuits (19). Synchro- 
nous firing of presynaptic neurons, such as re- 
tinal waves in the visual system (2, 36, 37, 39, 44, 45), 
is a prerequisite for the Hebbian plastic changes. 
In the present study, we did not observe spatial or 
temporal correlations for the spontaneous ac- 
tivities among OSNs (Fig. 2, A and B). Rather, 
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spontaneous activity patterns of OSNs were unique- 
ly correlated with OR types. We also showed that 
differing neural activity patterns induced the 
expression of different axon-sorting molecules 
regulating axonal segregation. On the basis of 
these results, we propose an activity-dependent 
mechanism, different from the Hebbian plastic- 
ity theory, in which specific patterns of sponta- 
neous activity determined by the expressed ORs 
contribute to generate the combinatorial code of 
axon-sorting molecules for OR-specific glomeru- 
lar segregation. 

Our calcium imaging analyses in OR substitu- 
tion experiments demonstrated that ORs de- 
termine spontaneous activity patterns. How are 
spontaneous activity patterns regulated by ORs? 
It has been reported that G protein-coupled re- 
ceptors (GPCRs), including ORs, produce unique 
levels of basal cAMP (cyclic adenosine monophos- 
phate) in a ligand-independent manner (12, 46, 47). 
A plausible explanation is that the OR-derived 
basal activity contributes to the pattern of spon- 
taneous activity. However, mutations of a GPCR 
that result in altered basal cAMP levels do not 
affect the expression of axon-sorting molecules 
in OSNs (72). Therefore, the OR-derived basal 
activity is unlikely to be a determinant of spon- 
taneous activity patterns. In this study, we found 
that individual OSNs show different firing pat- 
terns in response to continuous optogenetic 
stimulation (fig. S6D). Moreover, several voltage- 
dependent calcium channels and transient re- 
ceptor potential channels are expressed in a 
subset of OSNs (48-51). It is possible that the 
intrinsic membrane properties vary among OSNs 
expressing different OR types for OR-correlated 
patterns of spontaneous activity. However, the 
mechanisms linking the membrane properties 
to expressed ORs remain unknown. 

In this study, we demonstrated the firing 
pattern-dependent expression of axon-sorting 
molecules by optogenetic stimulation. The SVM 
classification indicated that optogenetic stimu- 
lation experiments mimicked physiological con- 
ditions (Fig. 3F and fig. S11. We found that 
phasic, repetitive short bursts of neural activity 
induced Kirrel2 expression, whereas prolonged 
durations of bursting activity induced the ex- 
pression of Sema7ZA and PCDH10. Increased levels 
of Kirrel2 were dependent on spike numbers 
and ISIs within bursts in the phasic burst stim- 
ulation (Fig. 3, D and E). The sensitivities to the 
burst duration might differ between PCDH10 
and Sema7A. We observed statistically signif- 
icant up-regulation of PCDH10 and Sema7A by 
burst durations of >20 s and >60 s, respectively 
(Fig. 4E). This firing pattern-dependent ex- 
pression allows OSNs to separately regulate the 
types and levels of axon-sorting molecules to 
generate the combinatorial code for glomerular 
segregation (fig. S12). 

How do OSNs decode the temporal patterns of 
their own neural activity into diverse expression 
of axon-sorting molecules? Previous studies have 
identified transcription factors that are activated 
by different patterns of calcium transients (52-54). 
For example, CaMKII (calcium/calmodulin- 
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dependent protein kinase II) is activated by 
high-frequency calcium oscillations, whereas 
NFATs (nuclear factors of activated T cells) are 
activated by low-frequency oscillations (55). The 
differential properties (e.g., sensitivity, affinity, 
and kinetics) of calcium-responsive factors may 
provide a molecular basis for the differential 
readouts of neural activity patterns. 

In the nervous system, neural activity is in- 
volved in various aspects of the neural de- 
velopment and plasticity—including cell type 
specification, dendritic branching, synaptic mat- 
uration, and learning and memory—through a 
complex program of gene regulation (56-59). 
Although several hundreds of genes have been 
identified as activity-dependent genes (60), their 
regulatory mechanisms and functions are not 
fully understood. In this study, we demonstra- 
ted firing pattern-dependent gene expression 
regulating neural circuit formation. With this 
strategy, neurons can generate variation through 
diversifying gene expression with only a single 
second messenger. The pattern-dependent regu- 
lation may also expand beyond development to 
the plasticity of neural circuits, which is the basis 
for learning and adapting to environmental 
changes throughout the lifetime. 


Materials and methods 
Mutant mice 


All experimental procedures were performed 
with the approval of the animal experiment ethics 
committee of the University of Tokyo and Insti- 
tutional Animal Care and Use Committee (LACUC) 
of RIKEN Kobe Branch, and in accordance with the 
guidelines for the care and use of laboratory ani- 
mals of the University of Tokyo and RIKEN Kobe 
Branch. H-Cre mice were as described (13). Ai95D, 
Ai32, M7I-ires-Cre, and CNG-KO mice were pur- 
chased from the Jackson Laboratory. pGoofy-Cre 
and Kirrel2 conditional KO (RBRC06340) mice 
were generous gifts provided by Y. Yoshihara 
and H. Sakano, respectively. To generate the H-J7 
(accession no. CDB0537T: www2.clst.riken.jp/ 
arg/TG%20mutant%20mice%20list.html) and 
H-MOR28 (accession no. CDB0536T: www2.clst. 
rikenjp/arg/TG%20mutant%20mice%20list.html) 
minigene constructs, a 10.1-kb SacI fragment con- 
taining the MOR23 gene from a BAC clone of a 
C57BL/6 mouse (RP23-306118) was subcloned into 
pBluescript II SK(+). The 2.1-kb H enhancer frag- 
ment (22) was attached to the 5’ end of the MOR23 
minigene and an ires-Cre-ires-mcherry fragment 
was inserted into the 3’-UTR of the minigene. The 
930-bp coding sequence of MOR23 was replaced 
with that of 77 or MOR28. The transgene se- 
quences were excised by AscI digestion, separa- 
ted from the vector DNA by sucrose gradient 
centrifugation, and then microinjected into the 
pronuclei of CD-1 embryos. 


Immunostaining and in situ hybridization 


Immunostaining was performed largely as 
described (22). Mice were anesthetized with 
sodium pentobarbital (2.5 mg/animal) and per- 
fused intracardially with 4% paraformaldehyde 
in phosphate-buffered saline (PBS). The olfac- 
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tory bulb (OB) was embedded briefly in an op- 
timal cutting temperature (OCT) compound 
(Tissue-Tek; Sakura Finetek) in liquid nitrogen. 
The olfactory epithelium (OE) was dissected and 
fixed overnight with 4% paraformaldehyde in 
PBS and decalcified in 0.5 M ethylenediamine- 
tetraacetic acid for 8 to 10 hours at 4°C. Tissues 
were then incubated in 30% sucrose overnight 
at 4°C and embedded in an OCT compound in 
liquid nitrogen. Serial sections (10 um for the 
OE, 12 um for the OB) were prepared with 
CM3050 cryostat (Leica) and collected onto 
Matsunami adhesive silane-coated glass slides 
(Matsunami Glass). Slides were rinsed with PBS 
three times and incubated in a blocking buffer 
[5% skim milk in PBS with 0.2% Triton X-100 
(PBST)] for 1 hour at room temperature (RT). 
After the blocking, the slides were incubated 
with primary antibodies overnight at RT. After 
being washed with PBST three times, the sec- 
tions were treated with secondary antibodies 
(Molecular Probes) at a dilution of 1:400. The 
primary antibodies used in this study are listed 
in data S7. The VectaStain ABC Kit (Vector Lab- 
oratories Inc.) was used to obtain higher in- 
tensity in the immunostaining of ORs. In situ 
hybridization was performed as described (13). 
The sequences of the RNA probes that were 
used are listed in data S7. The full-length coding 
sequence for mcherry was subcloned into the 
pGEM-T vector (Promega) and used as a template. 


Signal intensity measurement and 
statistical analysis 


Optical and fluorescent images of sections were 
obtained using a BZ-X'700 microscope (Keyence). 
A glomerular structure was defined by immuno- 
fluorescence signals of vesicular glutamate trans- 
porter2 (VGlut2), and staining intensities of 
Kirrel2, Sema7A, and PCDH10 within glomeru- 
lus were measured using ImageJ (NIH). After 
subtracting the background signals, staining 
intensities were divided by the highest staining 
intensity and were normalized to 100% (Fig. 1C). 
In Fig. 3D, signal intensities were rescaled such 
that the rescaled mean in the shaded side is 1. 
The data are presented as means + SEM. 


Optogenetic stimulation 


Mice were placed on ice to anesthetize them by 
hypothermia and a skin incision was made over 
the dorsal surface of the head. Xylocaine jelly 
(2%, AstraZeneca) was applied to the bone over- 
lying the OE as a local anesthetic, and a cover slip 
was placed over the bone to prevent the olfactory 
tissue from drying out. The olfactory sensory 
neurons (OSNs) were stimulated using 470-nm 
LEDs (Lex2; Brain Vision LLC). To stimulate the 
OE unilaterally, LEDs were placed directly out- 
side the right side of the OE, whereas the other 
side was covered with thin aluminum to avoid 
light stimulation. After 12 hours of stimulation, 
the glass cover was removed from the skull and 
olfactory tissues were collected for reverse trans- 
cription polymerase chain reaction (RT-PCR) or 
in situ hybridization. For chronic stimulation, 
pups were wrapped tightly in gauze bandage 
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before stimulation and placed in an acrylic cham- 
ber. Blue LEDs were placed directly outside 
the dorsal side of the OE. After stimulation, pups 
were placed on a temperature-controlled heating 
pad before they were returned to their mothers. 
In all protocols, the light pulse duration was 3 ms 
with a power of 3 mW mm”. The timing, dura- 
tion, and cycle of LED light stimulation were con- 
trolled with a pulse stimulator (Nihon Kohden). 


RT-PCR 


Quantitative PCR was performed as described 
(13). The expression levels of Kirrel2, Sema7A, 
and PCDH10 were normalized to that of CNG. 
Primer sets and TaqMan probes used in this study 
were designed by Universal Probe Library (Roche) 
and Nihon Gene Research Laboratories Inc. 
(Miyagi). The primers and probe sequences used 
for RT-PCR are listed in data $7. 


Slice preparation 


Acute slices were prepared from neonatal mice 
(PD3-6). Mice were anesthetized and decapitated, 
and the olfactory tissue was embedded in 5% 
low-melting agar in Ringer’s solution (125 mM 
NaCl, 2.5 mM KCl, 2 mM CaCl., 2 mM MgCl, 
25 mM NaHCOs, 10 mM HEPES, and 10 mM 
glucose) and cut into 200- to 300-m-thick slices 
using a vibratome (Leica VT1200S) in ice-cold 
oxygenated artificial cerebrospinal fluid (aCSF; 
125 mM NaCl, 2.5 mM KCl, 1 mM MgCl., 2 mM 
CaClo, 1.25 mM NaH,PO,, 25 mM NaHCOs, and 
10 mM glucose). Slices were briefly transferred 
to an interface chamber containing oxygenated 
artificial cerebrospinal fluid (aCSF) at RT. 


Ex vivo culture 


For ex vivo culture experiments, mice were an- 
esthetized by hypothermia and were quickly de- 
capitated. The nose was dissected en bloc to 
prepare intact septal nasal cartilage with epithe- 
lia. Samples were briefly transferred to an inter- 
face chamber containing oxygenated aCSF and 
were incubated at 35°C for 6 hours. After the 
incubation, OE samples were postfixed overnight 
with 4% paraformaldehyde in PBS. 


Electrophysiological recording 


Loose-patch recording was performed largely 
as described (60). Briefly, after anesthesia was 
achieved by hypothermia, mice were quickly 
decapitated, and the nose was dissected en bloc 
to prepare intact OE. The septal nasal cartilage 
with epithelia was placed in a submerged rec- 
ording chamber with the mucus layer facing up 
and was continuously perfused with oxygenated 
aCSF at 35°C. For extracellular recording, pipettes 
with resistance of 4 to 8 MO were fabricated 
using the P-1000 micropipette puller (Sutter In- 
strument) and filled with aCSF. All recordings 
were performed using the Olympus BX51W0-I 
microscope. A 40x objective lens (0.8 numerical 
aperture, LUMplan FN; Olympus) was used to 
guide a patch pipette to individual OSNs. Re- 
cordings were performed in a loose patch con- 
figuration. A loose seal (resistance, 40 to 200 MQ) 
was formed by applying gentle suction when an 
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electrode tip contacted an OSN. Signals were 
band-filtered between 200 Hz and 5 kHz. 


Calcium imaging and data analysis 


Calcium imaging was conducted using acute 
slices of OE from mice expressing GCaMP6f. OE 
slices were placed in a recording chamber per- 
fused with aCSF at 3 to 4 ml/s. Fluorophores 
were excited at 488 nm and visualized using a 
507-nm long-pass emission filter. Videos were 
recorded at 10 frames/s for 10 min or 5 frames/s 
for 1 hour using a 40x water immersion objective 
lens (0.8 numerical aperture, CFI Apo 40XW 
NIR; Nikon), a Nipkow-disk confocal microscope 
(CSU-X1; Yokogawa Electric), and a cooled elec- 
tron multiplying charge-coupled device camera 
(ixon DU897; Andor Technologies). Image ac- 
quisition was controlled by Andor Solis (Andor 
Technologies). The fluorescence change was mea- 
sured as (F, - Fo)/Fo, where F, is the fluorescence 
intensity at a given time point and Fy is the 
baseline. Automated quantification of the fea- 
tures of calcium transients was performed using 
FluoroSNNAP software (67). Principal compo- 
nents analyses were performed with Origin Pro 
(OriginLab). Multidimensional scaling (MDS) 
and support vector machine (SVM) classifica- 
tion were performed with MATLAB software. 
FI score was calculated as a measure of the ac- 
curacy in the SVM classification (62). 


Correlation analysis 


To quantify the correlation between pairs of re- 
corded cells, the correlation index (CI) was cal- 
culated as described (33, 40). The formula was as 
follows: 


cl= Nap—at,+at)T 
Navo,r)Nz0,r)2At 


where N4x-at,+ae is the number of spikes from 
cell B that fall within +A¢ of any spike from A, T 
is the duration of the recording in seconds, and 
Naco,r) and Nzgo,r) are the total numbers of spikes 
detected from cells A and B during the recording. 
To evaluate the significance of deviations from 
the “random” baseline of the CI, surrogate data 
were generated through the random shuffling of 
real data. The spike coincidence time window in 
Fig. 2 was +At = 200 ms. 
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INTRODUCTION: Innate immunity relies on 
several families of pattern recognition molecules 
(PRMs) that broadly recognize microbial motifs 
and danger signals. Upon activation, multiple 
PRMs assemble into very large protein com- 
plexes or “signalosomes,” and some PRM adap- 
tors, such as RIP2, MAVS, TRIF, and ASC, have 
even been shown to form amyloid-like fila- 
ments. The assembly of very large molecular 
structures needs to be tightly regulated to 
avoid accumulation of potential toxic pro- 
tein aggregates, such as those causing neuro- 
degeneration. However, the means by which 
PRM signalosome assembly is controlled re- 
mains poorly understood. 
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RATIONALE: The integrated stress response 
(ISR) is a highly conserved pathway that trig- 
gers eIF2a phosphorylation, a key checkpoint 
in the control of cellular responses to various 
stresses. While studying the impact of the ISR 
on innate immunity, we discovered that heme- 
regulated inhibitor (HRI), one of the four known 
elF2a kinases, was essential for pro-inflammatory 
cytokine responses to intracellular bacterial 
pathogens. We sought to determine the under- 
lying mechanism. 


RESULTS: We found that HRI was critical for 
signaling downstream of NOD1 and NOD2, two 
intracellular PRMs of the nucleotide-binding 
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domain leucine-rich repeat (NLR) protein sup- 
erfamily that detect bacterial peptidoglycan 
and induce pro-inflammatory NF-«B signaling. 
Because HRI function had been previously 
associated with proteotoxicity, we speculated 
that it might control the assembly of NOD sig- 
nalosomes. We indeed observed that HRI, 
together with the heat shock protein HSPB8, 
was necessary for the folding and release from 
endomembranes of NOD signalosomes after 
peptidoglycan stimulation. Presynthesized 
HSPB8 was released from HRI and was rap- 
idly recruited to the PRM complex. Conco- 
mitantly, HRI was activated, triggering a 

pathway dependent on 
elF2a, ATF4, and ATF3, 
Read the full article Which resulted in tran- 
at http://dx.doi. scriptional up-regulation 
org/10.1126/ of HSPB8. The HRI/eIF2a/ 
science.aaw4144 ATF4/HSPBS signaling axis 
is thus critical for control- 
ling the scaffolding of NOD signalosome and 
the sustained activation of NF-«B signaling. 
We further demonstrated that the HRI/eIF2o0 
signaling axis was also essential for signaling 
downstream of MAVS and TRIF but dispens- 
able for pathways dependent on MyD88 or 
STING; whether HSPB8 (or another HSP not 
yet identified) is involved in the HRI-dependent 
control of these multiple PRM pathways re- 
mains to be characterized. We further noticed 
that the PRM pathways regulated by HRI share 
the property that their adaptor proteins can 
form amyloid-like filaments in vitro. Indeed, 
overexpression of these proteins activated HRI, 
which suggests that potentially toxic molecu- 
lar superstructures, such as self-assembling 
amyloid-like filaments, may be direct activa- 
tors of the HRI signaling axis. In agreement 
with these findings, expression of o-synuclein, 
a protein that forms toxic amyloid filaments 
that are a pathological hallmark of Parkinson’s 
disease, induced ATF3 and HSPB8 expression 
through HRI. 


CONCLUSION: The HRI/eIF20/HSPBS8 signal- 
ing axis identified here shares a remarkable 
homology with the unfolded protein response 
(UPR), which regulates protein folding in the 
endoplasmic reticulum. We propose that HRI, 
eIF2a, and HSPB8 define a novel cytosolic 
UPR (cUPR) essential for optimal innate im- 
mune signaling by large molecular platforms. 
Future studies should be aimed at delineating 
the role of the cUPR in the control and clear- 
ance of pro-aggregative proteins, such as those 
implicated in neurodegeneration. 
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Multiple cytosolic innate sensors form large signalosomes after activation, but this 
assembly needs to be tightly regulated to avoid accumulation of misfolded aggregates. 
We found that the elF2a kinase heme-regulated inhibitor (HRI) controls NOD1 signalosome 
folding and activation through a process requiring eukaryotic initiation factor 2a (elF2a), 
the transcription factor ATF4, and the heat shock protein HSPB8. The HRI/elF2a signaling 
axis was also essential for signaling downstream of the innate immune mediators NOD2, 
MAVS, and TRIF but dispensable for pathways dependent on MyD88 or STING. Moreover, 
filament-forming a-synuclein activated HRI-dependent responses, which suggests that the 
HRI pathway may restrict toxic oligomer formation. We propose that HRI, elF2a, and 
HSPB8 define a novel cytosolic unfolded protein response (cUPR) essential for optimal 
innate immune signaling by large molecular platforms, functionally homologous to the 
PERK/elF2a/HSPAS axis of the endoplasmic reticulum UPR. 


he integrated stress response (ISR) is an 
evolutionarily conserved signaling cascade 
through which cells can cope with their 
environment and adjust their metabolic 


stress (3); and HRI responds to a broad range of 
stresses, including heme deprivation, oxidative 
stress, and heat shock (4-7), although cytosolic 
proteotoxicity may represent an overarching trig- 


status. Detection of various cellular stresses 
by the ISR is performed by four kinases: PKR 
(protein kinase R), GCN2 (general control non- 
derepressible 2), PERK (protein kinase RNA-like 
ER kinase), and HRI (heme-regulated inhibitor). 
PKR detects viral double-stranded RNA (dsRNA); 
GCN2 senses uncharged tRNAs that accumulate 
in cells lacking free amino acids (AA) (J, 2); PERK 
detects accumulation of misfolded proteins in the 
endoplasmic reticulum (ER), also known as ER 
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ger for HRI (8). These four kinases phosphorylate 
elF2a (eukaryotic initiation factor 2a) on Ser”, 
which results in a temporary block in translation 
and transcriptional reprograming to cope with 
the encountered stress. 


The HRI/elF2a axis is required 
for inflammatory responses 
during infection 


To characterize the role of the ISR in host re- 
sponses to bacterial pathogens, we infected mouse 
embryonic fibroblasts (MEFs), expressing either 
the wild-type form of eIF2a or a knock-in (KI) 
mutation encoding the Ser®! > Ala (S51A) sub- 
stitution, with Salmonella, Shigella, and Listeria, 
which induce eIF2a phosphorylation (9-7). In 
line with the established role of eIF2a phospho- 
rylation in stress granule formation, infection 
with the three pathogens induced stress granule 
accumulation in wild-type but not KI MEFs 
(fig. S1, A to C). However, we surprisingly noticed 
that the secretion of pro-inflammatory cytokines 
(Fig. 1A and fig. S1, D to H) was blunted in KI 
MEFs relative to wild-type cells in response to 
the three pathogens in a manner independent of 
bacterial replication (fig. S1, I to K). The reduced 
secretion of Cxcll in the KI MEFs correlated with 
reduced expression of Cxcll (Fig. 1B), showing 


5 July 2019 


that the ISR is required for transcriptional cyto- 
kine responses to infection. Shigella infection in- 
duced the expression of ATF3 and ATF4 (fig. S2), 
and as expected, up-regulation of the ISR-associated 
gene AZf3 was dependent on eIF2a phosphoryl- 
ation and ATF4, an essential transcription factor 
that, upon eIF2o phosphorylation, mediates trans- 
criptional reprogramming during the ISR (72) 
(fig. S3). 

In agreement with results obtained in KI cells, 
Cexcll expression after Shigella infection was 
blunted in.Atf#/~ MEFs (Fig. 1C). Down-regulation 
of Shigella-induced cytokine expression in ISR- 
deficient cells was not a consequence of exacer- 
bated translation, as reduced translation was 
observed in Atf4#/" MEFs (fig. S4). Finally, the 
impact of acute inhibition of the ISR was analyzed 
by using ISRIB, a specific inhibitor of eIF2a phos- 
phorylation. ISRIB treatment potently inhibited 
IL8, ATF3, and GADD34 expression in Shigella- 
infected cells at doses that did not alter transla- 
tion (fig. $5). Thus, phosphorylation of eIF2a is 
essential for pro-inflammatory and stress re- 
sponses during bacterial infection. 

We next aimed to identify the eIF2a kinases 
that mediate the ISR during bacterial infection. 
Our previous work has shown that invasive bac- 
terial pathogens trigger GCN2 activation (9, 10), 
but the contribution of the other eIF2a kinases 
remains undetermined. Upon infection of HeLa 
cells with Shigella, we observed the accumula- 
tion of a slower-migrating form of HRI, indica- 
tive of phosphorylation and activation, and this 
effect increased at later times of infection (Fig. 1D). 
As a control, treatment with arsenite, a known 
HRI inducer, resulted in full conversion of HRI 
to its slower-migrating form (Fig. 1D). HRI acti- 
vation is also associated with its dissociation from 
Hsc70 (13); both Shigella infection and arsenite 
treatment induced HRI/Hsc70 dissociation (fig. 
S6). We next assessed the relative contributions 
of HRI and GCN2 in triggering the ISR during 
infection by knocking down their expression (fig. 
87, A and B). Although the formation of stress 
granules after arsenite treatment or AA starva- 
tion depended on HRI and GCN2, respectively 
(figs. S7C and S8), we observed a partial contrib- 
ution of each kinase to Shigella-induced stress 
granule formation and eIF2a phosphorylation 
(fig. S7, D and E). Transient knockdown of both 
HRI and GCN2 [double knockdown (DKD)] re- 
sulted in near-complete abolishment of stress gran- 
ule formation and eIF2a phosphorylation after 
Shigella infection, relative to scrambled cells (Fig. 
1E and fig. S7D), thus showing that HRI and GCN2 
cooperate to trigger the ISR during Shigella 
infection. 

We next analyzed the relative contributions of 
HRI and GCN2 to the transcriptional responses 
induced by Shigella. Whereas ATF3 expression 
after arsenite treatment or AA starvation was 
dependent on HRI and GCN2Q, respectively (fig. 
S7F), ATF3 induction during Shigella infection 
was dependent on both HRI and GCN2 (Fig. 1F). 
Moreover, the oxidative stress response gene 
HIMOX] was induced in an HRI-dependent man- 
ner, whereas the AA metabolism gene ASNS was 
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Fig. 1. The HRI/elF2a axis is required for inflammatory responses dur- 
ing infection. (A) Cxcll secretion from the supernatants of wild-type (WT) 
and elF2a S51A knock-in (KI) MEFs after infection with Shigella for the 
indicated times, as measured by ELISA. (B and C) Expression of Cxci/l1 in 
WT and KI MEFs (B) and WT and Atf4 knockout (Atf4-’~) MEFs (C) left 
unstimulated (CTR) or after infection with Shigella. (D) HeLa cells infected 
with Shigella or treated for 1 hour with 300 uM sodium arsenite (Ars) 
and analyzed by immunoblotting. (E) HeLa cells transduced with lentiviral 
particles targeting a scrambled sequence (SC), both HRI and GCN2 


[double knockdown (DKD)] left untreated (U), infected 1 hour with 
Shigella (S.f), treated 1 hour with 300 uM Ars or treated with Krebs-Ringer 
buffer (KRB) for 45 min and analyzed by immunoblotting. (F and G) 
Expression of ATF3 (F) and /L8 (G) in HeLa cells transduced with lentiviral 
particles targeting a scrambled sequence (SC), HRI (ShHRI), or GCN2 
(shGCN2) and infected with Shigella. (H and I) Expression of CXCL1 (H) 
and /L1a (1) in scrambled, HRI KD, and GCN2 KD HeLa cells infected 

with Shigella. Data are means + SD from three independent experiments. 
*P°= 0:05, **P'< 0.01, 7 P< 0.001. 


induced in a GCN2-dependent manner in Shigella- 
infected cells (fig. S7, G to J). During infection 
with Shigella, expression of IL-8, CXCL1, and 
IL-1a (Fig. 1, G to I) was strongly repressed in HRI 
knockdown (KD) cells, whereas knockdown of 
GCN2 had no effect. Down-regulation of Shigella- 
induced cytokine expression in HRI KD cells was 
not a consequence of exacerbated translation, 
as reduced translation was observed relative to 
scramble control and GCN2 KD cells (fig. S9). 
Thus, although both HRI and GCN2 are needed 
for the full ISR during Shigella infection, HRI is 
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the sole eIF2o kinase controlling ISR-dependent 
pro-inflammatory responses to this pathogen 
(fig. S10). 


HRI is essential for NOD1- and 
NOD2-driven inflammatory responses 

In nonmyeloid cells, the NOD1/2-NF-«B axis is 
critical for inflammatory signaling in response 
to Shigella (14-16). After Shigella infection, NF-kB 
activation was blunted in HRI KD cells (Fig. 2A 
and fig. S11A). The decreased NF-«B response 
was not due to a lack of bacterial invasion or 


2019 


replication (fig. S11B) and was independent from 
translation regulation, because p65 nuclear trans- 
location after Shigella infection occurred prior 
to translation arrest (fig. S12). In agreement with 
this finding, the expression of Cxcll and J16 (Fig. 
2, B and C) was markedly reduced in Shigella- 
infected Hri/~ MEFs relative to Hri*/* cells. Knock- 
down of HRI in HCT116 cells significantly blunted 
IL-8 expression in response to the NOD1 and 
NOD? ligands (Fig. 2D), which suggests that HRI 
directly controls NOD1/2-dependent signaling. 
As a control, we noted that the expression of 
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with Shigella. (D) Expression of /L-8 in scrambled and 
HCT116 KD cells after 4 hours of stimulation with 
L18-MDP (10 ng/ml) or C12-iE-DAP (10 pg/ml). 

(E) Cxcll secretion from the supernatants of HRI*”* 


and HRI-’~ bone marrow-derived macrophages (BMDMs) stimulated for 6 hours with L1I8-MDP (10 ng/ml) or C12-iE-DAP (10 ug/ml), as measured 

by ELISA. (F) Expression of Cxcll in intestinal organoids derived from HRI*’* and HRI-’~ mice treated for 4 hours with Shigella conditioned medium 
(CM Shigella). (G) Serum Cxcll in HRI*’*, HRI heterozygous (HRI*’~), and HRI-”~ mice after intraperitoneal injection with MDP, as measured by ELISA. 
(H and 1) Cxcll (H) and CCL2 (I) in the cecum homogenate of HRI*’* and HRI-’~ mice 4 days after infection with Citrobacter rodentium, as measured 
by ELISA. Graphs represent means + SD from three to five independent experiments. *P < 0.05, **P < 0.01, ***P < 0.001 (two-way ANOVA followed 


by Tukey multiple-comparisons test). 


NOD1 and NOD2 was unaffected by HRI or GCN2 
knockdown (fig. S13, A and B). Cxcll secretion 
was also reduced in Hri’~ bone marrow-derived 
macrophages (BMDMs) stimulated with NOD 
ligands either directly (Fig. 2E) or in synergy with 
lipopolysaccharide (LPS) stimulation (fig. S13C). 
Similarly, Cxvcll expression was reduced in Hri pie 
organoids stimulated with Shigella culture super- 
natant (Fig. 2F). Intraperitoneal injection of the 
NOD2 ligand muramyl dipeptide (MDP) triggered 
a blunted secretion of Cxcll in the serum of Hri7/~ 
mice relative to Hri‘’* littermates (Fig. 2G), a re- 
duced infiltration of neutrophils into the perito- 
neum (fig. S13, D and E), and a smaller release 
of interleukin (IL)-1f into the peritoneal cavity 
(fig. SI3F). Similar effects were noted when Hri al. 
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Hri‘’”, or Hri*’* mice were injected with the NOD1 
agonist FK-156 (fig. S13, G and H). To characterize 
the role played by HRI in an in vivo model of in- 
fection driven partly by NOD1 and NOD2 (i7, 18), 
we infected Hri’/~ and Hri*/* mice with Citrobacter 
rodentium. Significantly less cecal Cxcl1, Ccl2, and 
serum Cxcll (Fig. 2, H and I, and fig. S13D, as well 
as a trend for reduced serum Ccl2 and cecal [16 
(fig. S13, J and K), were observed in C. rodentium- 
infected Hri’~ mice relative to Hri*’* littermates, 
in line with studies demonstrating reduced serum 
IL-6 in Hri~ mice during Listeria infection (19). 
This blunted pro-inflammatory response was not 
due to differences in colonization, because both 
groups of mice displayed a similar bacterial burden 
(fig. S13L). Together, these results demonstrate a 
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key role for HRI in controlling NOD1/2-driven 
pro-inflammatory responses in cell lines, in pri- 
mary cells, and in vivo. 


The HRI signaling axis controls the 
folding and solubility of NOD1 oligomers 


We speculated that HRI might control NOD1/2 
signaling at the level of complex formation. Scram- 
ble control or shHRI (short hairpin HRI) cells 
transfected with NOD1-HA (NOD1 fused to 
hemagglutinin) and either unstimulated or stimu- 
lated with C12-iE-DAP were lysed in non-ionic 
(NP-40) or ionic [radioimmunoprecipitation assay 
(RIPA)] buffers. Stimulation with C12-iE-DAP 
resulted in the release of NOD1 into the soluble 
fraction of NP-40-lysed cells, and analysis by blue 
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native polyacrylamide gel electrophoresis (PAGE) 
revealed that the released NOD1 was in the form 
of large oligomers (Fig. 3A, left panels). Interest- 
ingly, although NOD1 was expressed normally in 
shHRI cells, stimulation with C12-iE-DAP failed 
to release the protein into the soluble fraction, 
as it remained mainly insoluble (Fig. 3A, left 
panels). To determine whether NOD1 proteins 
did not form oligomers at all upon stimulation or 
whether those oligomers did form but could 
not be solubilized by NP-40, we performed a 
similar analysis in cells lysed with RIPA, which 
strips most non-transmembrane proteins from 
membranes. Using this buffer, NOD1 complexes 
could now be solubilized in shHRI cells, and blue 
native PAGE analysis revealed that they were 
constitutively found as large complexes (Fig. 3A, 
right panels). Thus, NOD1 forms large signaling 
platforms, or signalosomes, that are soluble in 
both ionic and non-ionic buffers in control cells, 
but these complexes can only be extracted by 
an ionic buffer in shHRI cells. 

To gain better understanding of NOD1 com- 
plex formation in HRI-silenced cells, we trans- 
fected scramble control and shHRI cells with 
NODI-HA and stimulated them with C12-iE-DAP, 
as above, or left them unstimulated (Fig. 3A). 
These cells were then lysed in RIPA buffer, and 
NODI1 complexes were immunoprecipitated and 
incubated with the dye SYPRO Orange, whose 
fluorescence increases when the dye binds hy- 
drophobic patches or misfolded proteins. To con- 
trol for the robustness of the assay, we denatured 
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the NOD1 immunocomplexes and an unrelated 
purified protein by boiling, resulting in increased 
fluorescence (Fig. 3B and fig. S14, A and B). 
Whereas NOD1 oligomers displayed similar fluo- 
rescence in scramble versus shHRI cells in con- 
trol conditions, stimulation with C12-iE-DAP 
resulted in a significant increase in fluorescence 
only in shHRI cells (Fig. 3C and fig. $14C), which 
suggests improper folding of the NOD1 com- 
plexes assembled after ligand stimulation in 
shHRI cells. In agreement with this finding, 
acute inhibition of elF2a-dependent signaling 
with ISRIB also increased the hydrophobicity of 
NOD1 complexes in cells stimulated with C12-iE- 
DAP (Fig. 3D). Thus, the HRI/eIF2o signaling axis 
controls the assembly of the NOD1 signalosomes 
after ligand stimulation. 


The HRI-elF2c-HSPB8 loop is essential 
for NOD1 and NOD2 signaling 


We next aimed to determine how HRI controls 
NOD signalosome assembly. Heat shock protein 
(HSP) chaperones regulate the assembly and stab- 
ility of protein complexes, so we speculated that 
HRI might regulate NOD signalosome assembly 
through the expression and action of HSPs. In our 
earlier microarray analysis of Shigella-induced 
genes (9), HSPB8 was among the most up- 
regulated genes and was the only HSP regulated 
by Shigella (fig. S15, A and B), an observation 
that we confirmed in wild-type MEFs (Fig. 4A). 
Hspb8 was also up-regulated by the unfolded 
protein response (UPR)/ER stress inducer thap- 
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Fig. 3. The HRI signaling axis controls the 
folding and solubility of NOD1 oligomers. 
(A) Scrambled and HRI KD HEK293T cells 
were stimulated for 30 min with C12-iE-DAP 
(10 ng/ml) and analyzed by blue native PAGE 
and SDS-PAGE. (B) HEK293T cells were 
stimulated with C12-iE-DAP (10 ng/ml) for 

30 min, and NOD1-HA immunocomplexes were 
subjected to SYPRO Orange hydrophobicity 
measurements. RFU, relative fluorescence units. 
(C) Scrambled or HRI KD HEK293T cells were 
stimulated for 30 min with C12-iE-DAP 

(10 ng/ml) and NODI1-HA immunocomplexes 
were analyzed as in (B). (D) HEK293T cells 
with or without ISRIB treatment (1 ug/ml) 


were stimulated with C12-iE-DAP (10 ng/ml) and NOD1-HA immunocomplexes were analyzed as 
in (B). Graphs represent means + SD from three or four independent experiments. *P < 0.05, 


**P < 0.01; ns, not significant. 
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sigargin (Fig. 4A), whereas the UPR/ER stress 
chaperone Hspa5 (also called BiP or GRP78) 
was induced by thapsigargin but not Shigella 
infection (fig. SI6A). Thapsigargin also induced 
HSPB8 and HSPA5 expression in HCT116 cells 
(fig. S16, B and C) and primary murine intestinal 
organoids (fig. S16, D and E). Thus, both HSPB8 
and HSPA5 are induced by UPR/ER stress, but 
HSPB8 is additionally up-regulated by infec- 
tion. We also analyzed the kinetics of HSPB8 
induction by Shigella and observed that whereas 
the pro-inflammatory genes JL8 and CXCL1, 
as well as the stress-induced genes ATF3 and 
GADD34, were induced in a first wave (as early 
as 1 hour after infection), HSPB8 was induced 
in a second wave, at 2 to 4 hours after infection 
(fig. S16, F to J). 

To further characterize the pathway that con- 
trols HSPB8 expression, we infected Atf#/~ MEFs 
and CRISPR-engineered ATF3/~ HCT1I6 cells 
(fig. S17A) with Shigella or stimulated them with 
thapsigargin. Shigella-induced up-regulation of 
HSPB8 was ATF4-dependent (Fig. 4B) as well as 
ATF3-dependent (fig. S17B), in agreement with 
kinetics data showing that ATF3 is induced before 
HSPB8 (fig. S16, F to J), whereas induction by 
thapsigargin required neither ATF4 nor ATF3 
(Fig. 4B and fig. S17B). In contrast, HSPA5 up- 
regulation by thapsigargin stimulation was par- 
tially dependent on ATF4 and did not require 
ATF3 (fig. S17, C and D), in agreement with the 
known reliance of HSPA5 expression on the IRElo. 
and ATF6 branches of the UPR (20, 27). More- 
over, Hspb8 up-regulation by Shigella infection 
was abrogated in eIF2a KI MEFs (fig. S17E), al- 
though baseline levels of Hspb8 mRNA and pro- 
tein (fig. S17F) were higher in KI as compared to 
wild-type MEFs in uninfected cells, possibly as a 
result of higher baseline levels of Ai3 (see fig. S3A). 
Finally, HRI but not GCN2 controlled HSPB8 up- 
regulation after Shigella infection (Fig. 4C); this 
effect was specific, as HSP90 expression was un- 
affected by HRI knockdown (fig. S17G). 

At baseline, protein levels of HSPB8 were lower 
in Shigella-infected shHRI cells than in scramble 
control or shGCN2 cells (fig. S17H). Moreover, by 
30 min after Shigella infection, a decline of the 
protein HSPB8, but not HSP90 or Hsc70, was 
observed (fig. S17H), which suggests that HSPB8 
chaperoning activity was engaged rapidly during 
infection and may result in its degradation. It is 
thus possible that the HRI-dependent transcrip- 
tional up-regulation of HSPB8 during infection 
represents a regulatory feedback loop to restore 
HSPB8 levels. Together, these results identify 
HSPB8 as a key HSP induced by Shigella infec- 
tion through the HRI/eIF20/ATF4/ATF3 signal- 
ing axis. 

We then asked whether HSPB8 mediated the 
effects of HRI on NOD-dependent signaling. 
Short hairpin RNA (shRNA)-mediated silencing 
of HSPBS8 (fig. S171) strongly inhibited Shigella- 
induced /L-8 up-regulation (Fig. 4D), whereas 
overexpression of HSPB8 in HRI KD cells or in 
HRI CRISPR knockout cells rescued Shigella- 
induced NF-«B signaling and inflammatory re- 
sponses (Fig. 4E and figs. $18 and S19). Thus, the 
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or HRI KD HEK293T cells transfected with NOD1-HA with or without 


HSPB8-V5 and analyzed as in Fig. 3A. (land J) Scrambled or HRI KD 
HEK293T cells transfected with or without HSPB8-V5 (I) and scrambled or HSPB8 KD HEK293T cells (J) were stimulated for 30 min with C12-iE-DAP 

(10 ng/ml) and NOD1-HA complexes were subjected to SYPRO Orange hydrophobicity assays. Graphs display means + SD from three to five independent 
experiments. *P < 0.05, **P < 0.01, ***P < 0.001. Asterisk in (F) denotes a nonspecific band. 


HRI signaling axis regulates Shigella-induced 
pro-inflammatory NF-«B signaling through an 
HSPB8-dependent process. 

We next aimed to uncover the mechanism 
through which the HRI-HSPB8 pathway regula- 
tes Shigella-induced pro-inflammatory NF-«B sig- 
naling. Drawing parallels with the UPR pathway 
in which activation occurs after displacement 
of chaperones to the proteins requiring refolding 
(22), we speculated that HRI might interact with 
HSPB8. Coimmunoprecipitation assays confirmed 
that HSPB8 interacted with HRI oligomers at 
the endogenous level, and the proteins rapidly 
dissociated after infection with Shigella (Fig. 4F). 
Concomitantly, an interaction between NOD1 im- 
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munocomplexes and HSPB8-V5 (HSPB8 fused 
to a V5 tag) was observed, which peaked at 10 min 
after infection (Fig. 4G). Thus, displacement of 
HSPB8 from HRI to forming NODI1 signalosomes 
may act as a signal for HRI activation. Notably, 
HSPB8 also interacted with NOD2 complexes 
and NOD2-HSPB8 interactions increased after 
MDP stimulation (fig. S20). Furthermore, ectopic 
expression of HSPB8 was sufficient to rescue the 
release of NP-40-soluble NOD1 complexes (Fig. 
4H) and to reduce the hydrophobicity of NOD1 
immunocomplexes in C12-iE-DAP-stimulated 
HRI KD cells (Fig. 41). In agreement with these 
findings, silencing of HSPB8 increased the hydro- 
phobicity of NOD1 immunocomplexes after stim- 
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ulation with C12-iE-DAP (Fig. 4J). Thus, the HRI- 
HSPB8 axis regulates NOD1 signaling through 
the control of signalosome assembly and folding, 
and HSPB8 displacement from HRI to the NOD1 
signalosome likely serves as a signal to trigger an 
HRI-dependent eIF2a-ATF4-ATF3 pathway induc- 
ing transcriptional up-regulation of HSPBS8 for 
homeostatic regulation. 


HRI differentially regulates 
PRM signaling 


We next aimed to explore whether the HRI/eIF2a/ 
HSPB8 axis regulates other pattern recognition 
molecule (PRM) pathways. We first investigated 
whether Toll-like receptor (TLR) signaling was 
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HRI +/+ HRI -/- 


Puerto Rico/8/34 (HI1N1) influenza A virus (IAV) at 6 days after infection, as measured by flow cytometry. 
(I) Representative micrographs (magnification 40x) of H&E-stained lungs from HRI*”*, HRI*”~, and HRI"’~ mice 
infected as in (G) and (H). (J and K) Secretion of IL-18 from the supernatant of HRI*’* and HRI-’~ BMDMs 


stimulated with LPS (100 ng/ml) for 3 hours alone or subsequently stimulated with 10 uM nigericin or 5mM ATP for 45 min (J) and BMDMs primed for 4 hours with 
LPS (100 ng/ml) in Opti-MEM and stimulated with Fugene alone or transfected with poly(dA:dT) for 16 hours (K), as measured by ELISA. (L) HRI*”* and HRI-’~ 
BMDMs were stimulated for 3 hours with LPS (100 ng/ml) and stimulated with 10 uM nigericin for 45 min, then analyzed by blue native PAGE. Bar graphs represent 
means + SD from three to five independent experiments. *P < 0.05, **P < 0.01, ***P < 0.001 (one-way ANOVA followed by Tukey multiple-comparisons test). 


influenced by HRI. Stimulation of scramble con- 
trol cells or shoHRI HCT116 with the TLR5 ligand 
flagellin triggered similar expression of IL-8 (fig. 
$21A). Stimulation of Hri‘/* and Hri/- BMDMs 
with the TLR4, TLR5, and TLR2 ligands LPS, flag- 
ellin, and Pam3CSK4, respectively, induced simi- 
lar secretion of Cxcll (Fig. 5A). In agreement with 
this finding, MyD88 signaling was similar in LPS- 
stimulated Hri*’* and Hri’- BMDMs (Fig. 5B and 


Abdel-Nour et al., Science 365, eaaw4144 (2019) 


fig. S21, B and C). In contrast, TRIF activation 
was blunted in LPS-treated Hri/- BMDMs (Fig. 5C 
and fig. S21, B and D), which suggests that HRI 
regulates the TLR4/TRIF but not the TLR4/ 
MyD88 signaling axis. Consistently, intraperitoneal 
injection of Hri‘/*, Hri*”, or Hri’/~ mice with LPS 
triggered similar levels of serum Cxcll (fig. S21E). 
TLR3-TRIF signaling was also partially impaired 
in Hri’- BMDMs (Fig. 5D and fig. $22). Further- 
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more, Hri*/* and Hri’- BMDMs had similar ex- 
pression of Cxcli and Ifnb (Fig. 5, E and F) and 
secreted comparable amounts of interferon (IFN) 
a/B (fig. S23) after stimulation with inducers of 
the cGAS-STING pathway. In contrast, Hri/~ 
BMDMs expressed significantly less Cxcll and 
Ifnb and secreted less type I IFNs in response to 
RIG-I-MAVS triggers (Fig. 5, E and F, and fig. $23). 
In agreement, no differences in STING-driven 
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inflammation were observable in Hri*’* and Hri~ 
MEFs or BMDMs after infection with herpes sim- 
plex virus (HSV) (fig. S24). In contrast, a strong 
decrease of these markers, together with a re- 
duced formation of MAVS oligomers, was noted 
in Hri/- MEFs and BMDMs infected with the 
RIG-I-MAVS activator Sendai virus (SeV) (fig. 
$25). In agreement with a role for HRI in TRIF- 
dependent and MAVS-dependent responses, 
but not MyD88-dependent responses, we fur- 
ther noticed that acute inhibition of eIF2a phos- 
phorylation using ISRIB significantly inhibited 
116 and [fnb induced by polyinosinic-polycytidylic 
acid [poly(I:C)] addition (TLR3-TRIF trigger) or 
transfection (MAVS trigger), and that it inhib- 
ited Ifnb induced by LPS (TLR4-TRIF trigger) 
without affecting 16 (TLR4-MyD88 trigger) (fig. 
$26). In support of the role of HRI in host de- 
fense against RNA viruses, we observed a signi- 
ficant blunting of immune cell recruitment and 
reduced damage in the lungs of Hri’~ mice at 
day 6 after infection with influenza A virus 
(Fig. 5, G to I, and fig. $27). Finally, heptose-1,7- 
bisphosphate (HBP) sensing by TIFA [tumor 
necrosis factor receptor-associated factor (TRAF)- 
interacting protein with forkhead-associated do- 
main] also required HRI, as scramble control 
cells exhibited a robust induction of JZ-8 after 
stimulation with HBP-containing conditioned 
media from gmhB Neisseria meningitidis (14), 
which was severely reduced in shHRI cells (fig. S28). 

We then tested whether the NLRP3 inflamma- 
some required HRI for signaling; NLRP3 differs 
from other PRMs tested so far, in that NLRP3 
inflammasome assembly triggers rapid cell death 
and does not induce downstream transcrip- 
tional reprogramming. LPS-dependent priming 
of Hri*/* and Hri’- BMDMs, which relies on the 
TLR4/MyD88 axis, resulted in comparable up- 
regulation of NLRP3 and pro-IL-1f in the lysates 
of Hri‘/* and Hri’~ BMDMs (fig. S29A), in agree- 
ment with the results above. After LPS priming, 
stimulation with adenosine triphosphate (ATP) 
or nigericin yielded reduced release of mature 
IL-1B and cleaved caspase-1 into the supernatant 
of Hri’- BMDMs (fig. $29A). In agreement with 
this result, we observed decreased secretion of 
IL-18 (Fig. 5J) and IL-18 (fig. S29B) in Hri/ 
BMDMs. AIM2 inflammasome activity was also 
reduced in Hri/~ BMDMs (Fig. 5K and fig. S30). 
However, cell death induced by LPS transfection, 
which depends on the caspase-11 inflammasome 
(23), was unaffected in Hri/- BMDMs, whereas 
caspase-1 activation and IL-1f release, which re- 
quire NLRP3 (24), were similarly blunted (fig. S31). 
NLRP3 activation results in the assembly of a well- 
structured oligomeric inflammasome complex 
that can be resolved using native gel electropho- 
resis. Interestingly, stimulation of BMDMs with 
nigericin resulted in blunted assembly of endog- 
enous NLRP3 inflammasomes in Hri cells (Fig. 
5L). Activation of the NLRP3 inflammasome fur- 
ther licenses assembly of ASC (apoptosis-associated 
speck-like protein containing CARD) prion-like poly- 
mers (25), which form specks. The endogenous 
ASC fibers and specks were reduced in Hri/~ 
BMDMs stimulated with nigericin (fig. S32). SYPRO 
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Orange hydrophobicity measurements of NLRP3 
inflammasome complexes after nigericin stimu- 
lation revealed greater hydrophobicity of NLRP3 
inflammasomes in Hri/” BMDMs (fig. S33), sug- 
gesting improper folding. However, ISRIB treat- 
ment had no impact on NLRP3 activation (fig. 
$34), which implies that HRI deficiency might 
induce a cellular state that affects NLRP3 in- 
flammasome assembly but that the eIF2a/HSPB8 
arm of the HRI signaling axis is likely not involved 
in the assembly or stability of the NLRP3 inflam- 
masome. This is not unexpected, given that the 
NLRP3-ASC complex rapidly leads to pyroptotic 
cell death and does not induce signaling that 
modulates transcription. Together, these results 
reveal that HRI regulates NLRP3/ASC inflamma- 
some assembly and function, although the under- 
lying mechanism of how this regulation occurs 
remains to be elucidated. 


HRI controls the cellular response to 
prefibrillar PRM adaptors and a-synuclein 


We noticed that the PRM pathways regulated by 


HRI share the property that their adaptor pro- 
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Overexpression of RIP2, MAVS, and TRIF re- 
sulted in an HRI-dependent induction of JZ8 and 
HSPB8 (Fig. 6, A and B), which suggests that the 
assembly of PRM signalosome adaptors might be 
an essential step for HRI activation. In contrast, 
overexpression of MyD88, which assembles “myd- 
dosomes” with well-defined stoichiometry that, 
contrary to “TRIFosomes,” do not form amyloid- 
like fibrils (26), induced JZ8 in an HRI-independent 
manner and did not up-regulate HSPB8 (Fig. 6, A 
and B). Overexpression of ASC induced neither 
IL8 nor HSPB8 (Fig. 6, A and B), in agreement 
with our observation (see above) that NLRP3 in- 
flammasome assembly is not controlled by the 
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Fig. 6. HRI controls the cellular response to 
prefibrillar PRM adaptors and o-synuclein. 

(A and B) /L8 (A) and HSPB8 (B) expression in 
HEK293T cells transduced with lentiviral particles 
targeting a scrambled sequence (SC) and HRI 
(shHRI) and transfected with the indicated 
plasmids. (C to E) ATF3 (C), CHOP (D), and 
HSPB8 (E) expression in SC and shHRI HEK293T 
cells transfected with the indicated plasmids. 


F 


Tubulin 


MG132: _- + - + 
SC shHRI 


(F) SC and shHRI SH-SY5Y cells treated with 5 uM MG132 for 4 hours and analyzed by Western 
blotting. Graphs represent means + SD from three to five independent experiments. *P < 0.05, 


**P < 0.01, ***P < 0.001. 
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eIF2a/HSPB8 arm of HRI signaling. Moreover, 
we observed that necroptosis signaling, which in- 
volves amyloid-like fibrillar assembly of RIPK1/3 
heterodimers (27), was also regulated by HRI 
(fig. S35). In agreement with this finding, over- 
expression of a-synuclein, a protein that forms 
toxic amyloid fibrils that are a pathological hall- 
mark of Parkinson’s disease (28), induced ATF3, 
CHOP, and HSPB8 (Fig. 6, C to E) expression in 
an HRI-dependent manner. In the neuroblastoma 
cell line SH-SY5Y, HRI silencing induced accumu- 
lation of dimers of endogenous o-synuclein phos- 
phorylated at Ser'?° (Fig. 6F), which is indicative 
of misfolding (29). Together, these data suggest 
that HRI-dependent signaling may be essential 
for controlling the assembly and/or stability of 
amyloid-like fibrils in the cytosol. 


Concluding remarks 


Our data reveal that HRI controls the formation 
and folding of large PRM signalosomes in the 
cytosol, thereby affecting downstream innate im- 
mune signaling. Because HRI differentially regu- 
lates PRM signaling, the protein may be sensitive 
to specific molecular superstructures. This idea 
is supported by the finding that HRI-regulated 
PRMs rely on adaptor proteins that, at least in 
vitro, assemble filamentous structures. Consid- 
ering that HRI is a stress-sensitive kinase of the 
ISR pathway, it is tempting to speculate that po- 
tentially toxic molecular superstructures, such as 
self-assembling amyloid-like fibrils or filaments, 
are direct activators of the HRI signaling axis. If 
this were the case, HRI could be involved in de- 
tecting and restricting the accumulation of cyto- 
solic protein aggregates that are associated with 
neurodegenerative diseases, as hinted by our pre- 
liminary results with o-synuclein. Future bio- 
chemical studies should delineate the detailed 
nature of the (supra-)molecular structures that 
activate HRI. 

The HRI/eIF2a/HSPBS8 signaling axis shares a 
remarkable homology with the PERK-dependent 
branch of the UPR, which regulates protein fold- 
ing in the ER. Indeed, the eIF2o kinase PERK 
senses the accumulation of misfolded proteins 
in the ER and is activated after dissociation with 
HSPAS, and earlier studies suggest the presence 
of a similar mechanism of regulation between 
the chaperone p58 and PKR (30). This pathway 
stimulates the up-regulation of stress-associated 
genes, such as the ER chaperone HSPAS that 
encodes BiP/GRP78. We thus propose that HRI 
and HSPB8 represent functional homologs of 
PERK and BiP, respectively, and that the HRI/ 
eIF2o/HSPBS signaling axis detects certain forms 
of cytosolic UPR (cUPR). 


Materials and methods 
Reagents and stimulations 


To induce amino acid starvation, cells were rinsed 
three times with PBS and incubated in Krebs 
Ringer bicarbonate (KRB) buffer (118.5 mM 
NaCl, 4.74 mM KCl, 1.18 mM KH»PO,, 23.4 mM 
NaHCOs, 5 mM glucose, 2.6 mM CaCly, and 
1.18 mM MgSO,, adjusted to pH 7.6 by titra- 
tion with 1 N NaOH). Unless stated, chemi- 
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cals and ligands were all obtained from Sigma 
or Invivogen, respectively. 3'eGAMP or c-di-GMP 
was delivered into cells by diluting ligands in 
digitonin permeabilization buffer [5 ug/ml for 
poly(dA:dT) and poly(I:C), 2 ug/ml for dsRNA] 
as described (37). For activation of AIM2, poly 
(dA:dT) was delivered using/FuGene. Super- 
natant was collected 16 hours after stimulation 
unless specified. 


Cell lines 


HeLa, HEK293T, HCT116, MEFs, L929, and SH- 
SY5Y cells were all cultured in DMEM with 10% 
FBS and 1% penicillin and streptomycin. Primary 
MEFs were prepared as described (32). Lentiviral 
knockdown was performed using the targeting 
sequences in table S1 as described (33). Bone 
marrow-derived macrophages (BMDMs) were 
prepared from the femurs and tibia of mice by 
obtaining bone marrow and incubating cells in 
RPMI with 20% conditioned media from L929 
cells, 10% FBS, 1% penicillin and streptomycin, 
and 2 mM t-glutamine for 7 days. EIF2o. knock- 
in (KT) MEFs were obtained from R. J. Kaufman. 
ATF4 knockout MEFs were provided by R. Wek 
and were grown in DMEM with 10% FBS, 1% 
penicillin and streptomycin, and supplemented 
with 55 uM B-mercaptoethanol, nonessential 
amino acids (NEAA), and essential amino acids. 
CRISPR/CAS9 knockout cells were generated 
using the CRISPR-Cas9 system as described (34). 
THP-1 cells were grown in RPMI with 10% FBS 
supplemented with penicillin/streptomycin, NEAA, 
and 2-mercaptoethanol. 

Mice 

Hri*’” breeder mice were obtained from a pre- 
vious study (6). Mice were bred and housed un- 
der SPF conditions at the Division of Comparative 
Medicine, University of Toronto. All experiments 
were performed with 7- to 11-week-old littermate 
mice from HRI*/ crosses. All mice experiments 
were approved by the local Animal Ethics Review 
Committee. Genotyping was performed with the 
primers shown in table S2. 


Bacterial strains and in vitro 
bacterial infections 


Infections with Shigella flexneri, Salmonella 
Typhimurium, and Listeria monocytogenes were 
performed as described (9, 10). For Shigella infec- 
tions where the time is not indicated, infections 
were performed for 4 hours. For Listeria infec- 
tions of BMDMs, plates were centrifuged at 500g 
for 10 min after addition of bacteria, incubated 
at 37°C/5% CO, for 30 min, incubated with fresh 
gentamycin-containing medium (30 ug/ml), 
then left for an additional 2 hours. To measure 
bacterial replication, cells were infected as de- 
scribed above and lysed with PBS containing 
0.1% Triton; lysate was serial-diluted and plated. 
Shigella conditioned medium was produced by 
using stationary phase cultures, pelleting bac- 
teria by centrifugation (10,000g for 5 min) and 
filtering with a 0.22-um syringe filter. Neisseria 
gonorrhoeae conditioned medium was purified 
as described (35). 
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Viral infections 

Sendai virus and HSV were provided by D. Garcin 
and D. Boutolleau, respectively. Cells were incu- 
bated with Sendai virus (SeV at 40 HA/ml) or 
HSV (MOI of 5) diluted in serum-free DMEM 
and 2 hours later replaced with DMEM with 
20% calf serum and antibiotics, incubated at 
37°C. For influenza experiments, mice were intra- 
nasally (in 25 ul PBS) infected with a sublethal 
dose (50 plaque-forming units) with influenza 
A/Puerto Rico/8/34 virus provided by J. A. 
McCullers. All experiments were performed ac- 
cording to the animal research ethics board of 
McGill University. 


Reverse transcription quantitative 
polymerase chain reaction (RT-qPCR) 


Measurements of gene expression (unless other- 
wise specified) were performed by RT-qPCR. RT- 
PCR analysis was performed with SYBR green 
reagents as described (9). Murine and human 
qPCR primers used are listed in tables S3 and S4, 
respectively. For influenza infections, RNA was 
isolated using Qiazol reagent (Qiagen) and All- 
In-One RT MasterMix (abm) was used for cDNA 
synthesis. Table S5 indicates viral primers. 


Western blots 


Unless otherwise indicated, cells were lysed using 
RIPA. Lysates were centrifuged at 13,000g to 
separate out membrane fractions versus insolu- 
ble fractions and cytoplasmic proteins versus 
soluble fractions. The membrane fractions were 
resuspended in RIPA buffer with Laemmli blue 
loading buffer and boiled to obtain the insoluble 
fractions. Samples were run on acrylamide gels 
and transferred onto PVDF. Membranes were 
blocked with 5% milk in TBST and incubated with 
the indicated antibodies: mouse anti-tubulin 
(#15168, Sigma, 1/10,000 dilution), rabbit anti-eIF20 
(#9722S, NEB, 1/1000), rabbit anti-phospho-eIlF2a 
(#9721, NEB, 1/1000), rabbit anti-HRI (407-226, 
Upstate, 1/5000), anti-HRI polyclonal (6) (1/3000) 
rabbit anti-GCN2 (#ab137543, Abcam, 1/1000), mono- 
clonal anti-HA tag (#G036, Abcam, 1/1000), mo- 
noclonal anti-Flag-Tag (#F3165, Sigma, 1/1000), 
rabbit anti-V5-Tag (#13202, NEB, 1/1000), rabbit 
anti-HSPB8 (#3059, cell signaling, 1/500), rabbit 
anti-HSP90 (#ab13492, Abcam, 1/3000), rabbit anti- 
Hsc70 (#ab51052, Abcam, 1/1000), rabbit anti-ERK 
(#9102, NEB, 1/4000), mouse anti-phospho-ERK 
(#9106, NEB, 1/4000), mouse anti-IxBa (#4814, 
NEB, 1/4000), rabbit anti-IxBo. (+ 9242, NEB, 1/ 
2000), mouse anti-phospho-IkBa (#9246, NEB, 1/ 
2000), rabbit anti-GAPDH (#G9545, Sigma, 1/20,000), 
mouse anti-caspase-1(p20) (#AG-20B-0042, Adipo- 
gen, 1/4000), mouse anti-NLRP3 (##AG-20B-0014, 
Adipogen, 1/4000), goat anti-mouse IL-1B (#AF- 
401-NA, R&D Systems, 1/1000), rabbit anti-IRF3 
(#4302, NEB, 1/2000), rabbit anti-phospho-IRF3 
(Ser396) (#4947, NEB), rabbit anti-NAK/TBK1 
(EP611Y) (#ab40676, Abcam, 1/4000), rabbit anti- 
phospho-TBK1 (Ser172) (EPR2867(2)) (#ab109272, 
Abcam, 1/2000) or rabbit anti-phospho-TBK1/NAK 
(Serl172) (#5483, NEB, 1/2000), rabbit anti-MAVS 
(44983, NEB, 1/2000) and rabbit anti-MAVS 
(M-300) (#sc-68881, Santa Cruz, 1/1000), rabbit 
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anti-STING/TMEM173 (#19851-1-AP, Proteintech, 
1/2000), rabbit anti-ASC mouse specific (D2W8U) 
(#67824, NEB, 1/2000), rabbit polyclonal anti-ATF3 
(H-90) (#sc22978, Santa Cruz, 1/1000), rabbit 
anti-ATF4 (D4B8) (#11815, NEB, 1/1000), mouse 
monoclonal anti-puromycin (4G11) (#MABE342, 
Millipore, 1/5000), rabbit anti-o-synuclein phospho- 
$129 (#ab51253, Abcam, 1/3000), mouse anti- 
a-synuclein (4ab1903, Abcam, 1/3000). 


Immunofluorescence 


Preparation of samples for immunofluorescence 
assays was performed as described (7). Anti- 
bodies were used as follows: anti-Tia-1, 1:100 
(Santa Cruz); anti-p65, 1:100 (Abcam); anti-ASC, 
1:400 (NEB). 


Surface sensing of translation 
(SUnSEt) assay 


SUnSEt assays were conducted as described (11). 


Coimmunoprecipitation and SYPRO 
Orange experiments 


For coimmunoprecipitation experiments, cells 
were lysed using RIPA buffer on ice and sub- 
sequently centrifuged at 13,000g for 3 min. 
Protein G beads (Pierce) were washed twice 
with RIPA buffer and centrifuged at 2000g for 
10 min before adding lysate and the indicated 
antibodies. The immunoprecipitation reaction 
was incubated overnight at 4°C. Next, samples 
were centrifuged at 10,000g for 5 min, washed 
with RIPA buffer, resuspended in RIPA buffer 
with Laemmli blue loading buffer and boiled for 
10 min. For hydrophobicity measurements, the 
immunoprecipitated samples were washed three 
times with PBS and resuspended in PBS. Sam- 
ples were then incubated with a final concentra- 
tion of 5x SYPRO Orange and fluorescence was 
measured using a RT-qPCR machine (300 nm 
excitation/470 nm emission) between 20° and 
100°C. Hydrophobicity measurements of NOD1 
were performed by transfecting HEK293T cells 
with NODI1-HA plasmid. Each dot on the scat- 
terplots represents a data point from a unique 
experiment; m = 3 to 5 experiments were per- 
formed for each SYPRO Orange assay. 


Intraperitoneal injections 


Mice were injected intraperitoneally with 50 ug 
of MDP, 50 ug of FK-156, or 1 ug of LPS. Two 
hours after injection, mice were killed and whole 
blood was collected by cardiac puncture. To ob- 
tain the peritoneal lavage, 2 ml of PBS was in- 
jected into the peritoneal cavity, mixed briefly by 
massaging and collected. Peritoneal lavage was 
centrifuged for 10 min at 500g, the supernatant 
was used for ELISAs and the pelleted cells were 
used for flow cytometry. 


In vivo Citrobacter infections 


Citrobacter rodentium strain DBS 100 was cul- 
tured in LB medium overnight at 37°C. Bacterial 
colony-forming units (CFUs) were calculated after 
inoculation by plating on MacConkey agar. Mice 
were fasted overnight before being inoculated 
with 1 x 10° CFU bacteria in 200 ul volume by 
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oral gavage. C. rodentiwm was obtained from 
P. Sherman (HSC, Toronto). Mice were killed 
4 days after infection and the cecum and blood of 
mice collected for ELISA experiments. 


Flow cytometry 


Isolated peritoneal cells were sequentially incu- 
bated with the LIVE/DEAD Fixable Aqua Dead 
Cell Stain Kit (ThermoFisher Scientific), rat anti- 
mouse CD16/32 monoclonal antibody (clone 43, 
ThermoFisher), Alexa Fluor 700-conjugated rat 
anti-mouse Ly-6G (clone RB6-8C5, ThermoFisher), 
and APC-eFluor 780-conjugated rat anti-mouse 
CD11b (clone M1/70). For influenza infections, 
lungs were perfused with 10 ml of PBS, minced 
and digested in collagenase (150 U/ml) for 
1 hour at 37°C and 5% COs. Single-cell suspen- 
sions were obtained by crushing cells through a 
70-um filter and stained with Fixable Viability 
Dye eFluor506 (eBioscience) for 20 min in PBS 
at 4°C, washed, then stained with anti-mouse 
CD16/32 (clone 93, eBioscience) in 0.5% BSA/ 
PBS to block nonspecific binding for 5 min at 
4°C. Finally, cells were stained with BUV395- 
conjugated rat anti-mouse CD45.2 (clone 104, 
BD Biosciences), PE-Cy7-conjugated rat anti-mouse 
CD11b (clone M170, BD Biosciences), Alexa Fluor 
700-conjugated rat anti-mouse Ly6G (clone 1A8, 
BD Biosciences), and PE-conjugated rat anti- 
mouse CCR2 (clone 475301, R&D) and FITC- 
conjugated rat anti-mouse Ly6C (clone AC-21, 
BD Biosciences) in 0.5% BSA/PBS for 20 min at 
4°C. Flow cytometry analysis was performed 
on an LSRFortessa X-20 Cell Analyzer or 
LSRFortessa Cell Analyzer (influenza) (BD Bio- 
sciences) via FACSDiva acquisition software (BD 
Biosciences), and analysis performed with FlowJo 
v10.3 (FlowJo, LLC). 


Microarray analysis 


Data from a previous report (9) were repurposed. 


IFN protein quantification 


Quantification of total IFN in the supernatants 
of BMDMs was performed using B16-Blue IFN 
o/B reporter cell lines as per the manufacturer’s 
instructions (Invivogen). 


Enzyme-linked immunosorbent 
assays (ELISAs) 


ELISAs were performed according to the manu- 
facturer’s instructions. Cxcl1, IL-1B, IL-6, and mu- 
rine CCL2 ELISA kits were obtained from R&D 
Systems. Measurements of IL-1f in response to 
NLRP3 stimulation and IL-6 measurements in 
macrophages were performed with kits from 
Thermofisher Scientific. Murine IFNB ELISA 
and IL-18 ELISA kits were obtained from PBL 
Assay Science and MBL, respectively. 


Transfections 


For transient expression, transfections were per- 
formed with PEI MW25,000. HSPB8-V5 (36) and 
o-synuclein-GFP plasmids were obtained from 
Addgene. TIFA-flag was generated by S. D. Gray- 
Owen. NOD1-HA and NOD2-HA expression vec- 
tors were used in a previous study (37). 
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Luciferase 
Luciferase assays were performed as described 
and adapted to HeLa cells (37). 


Organoids 


Murine organoid cultures were obtained and 
grown as described (38). 


Histopathology 


Lungs were excised and fixed in 10% formalin for 
48 hours. Hematoxylin and eosin (H&E) staining 
was performed by the Histopathology Core of the 
RI-MUHC of McGill University and scanned up 
to 40x on the Aperio AT Turbo (Leica). 


Blue native PAGE 


Cells were lysed using either NP-40 lysis buffer to 
obtain cytoplasmic proteins and loosely membrane- 
associated proteins or RIPA to obtain all proteins. 
After lysis, experiments were performed as de- 
scribed (35). 


Semi-denaturing detergent agarose 
gel electrophoresis (SDD-AGE) 


For size estimation of MAVS aggregates, SDD- 
AGE was performed as described (25). 


Cell death assays 


Primed BMDMs [Pam3CSK,, (1 g/ml) for 4 hours] 
were transfected with LPS using FuGene. LDH 
release was measured using LDH Cytotoxicity 
Detection Kit (Clontech). Percentages were ob- 
tained by comparing to Triton-lysed cells (100% 
cell death) and subtracting untreated controls. 

Cell death was also quantified by monitoring 
propidium iodide (PI) incorporation (5 ug/ml) 
via fluorescence measurements over 24 hours on 
a microplate fluorimeter (Fluostar Omega, BMG 
Labtech). 


ASC oligomerization 


After stimulation, cells were lysed with cold PBS 
containing 0.5% Triton X-100 and centrifuged at 
6000g for 15 min at 4°C. The pellets were washed 
twice in PBS and freshly prepared disuccinimidyl 
suberate (2 mM) was added to the resuspended 
pellets incubated with mixing for 30 min. The 
pellets were collected by centrifugation and re- 
dissolved in SDS-PAGE sample buffer. 


Statistics 


For analysis of data in fig. S16, B to E, a Student t 
test was used. For multiple comparisons, data 
was analyzed using analysis of variance (ANOVA) 
with Bonferroni post-test for cellular assays and 
Tukey post-test for in vivo experiments. 
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RNA-guided DNA insertion with 
CRISPR-associated transposases 


Jonathan Strecker’”****, Alim Ladha’”***, Zachary Gardner®”**, 
Jonathan L. Schmid-Burgk””**, Kira S. Makarova’, 


Eugene V. Koonin®, Feng Zhang ’”*?** 


CRISPR-Cas nucleases are powerful tools for manipulating nucleic acids; however, 
targeted insertion of DNA remains a challenge, as it requires host cell repair machinery. 
Here we characterize a CRISPR-associated transposase from cyanobacteria Scytonema 
hofmanni (ShCAST) that consists of Tn7-like transposase subunits and the type V-K 
CRISPR effector (Cas12k). ShCAST catalyzes RNA-guided DNA transposition by 
unidirectionally inserting segments of DNA 60 to 66 base pairs downstream of the 
protospacer. ShCAST integrates DNA into targeted sites in the Escherichia coli genome 
with frequencies of up to 80% without positive selection. This work expands our 
understanding of the functional diversity of CRISPR-Cas systems and establishes a 


paradigm for precision DNA insertion. 


rokaryotic clustered regularly interspaced 
short palindromic repeats (CRISPR) and 
CRISPR-associated proteins (Cas) systems 
provide adaptive immunity against for- 
eign genetic elements via guide RNA- 
dependent DNA or RNA nuclease activity (7-3). 
CRISPR effectors, such as Cas9 and Cas12, have 
been harnessed for genome editing (4-9) and 
create targeted DNA double-strand breaks in the 
genome, which are then repaired by endogenous 
DNA damage repair pathways. Although it is 
possible to achieve precise integration of new 
DNA after Cas9 cleavage either through homol- 
ogous recombination (J0) or nonhomologous 
end-joining (11, 12), these processes are ineffi- 
cient and vary greatly depending on cell type. 
Homologous recombination repair is also tied 
to active cell division, making it unsuitable for 
postmitotic cells. Recently, an alternative ap- 
proach for making point mutations on DNA has 
been developed that relies on using dead Cas9 
(13) to recruit cytidine or adenine deaminases 
to achieve base editing of genomic DNA (/4-16). 
However, base editing is restricted to nucleotide 
substitutions, and thus efficient and targeted in- 
tegration of DNA into the genome remains a 
major challenge. 
To overcome these limitations, we sought to 
leverage self-sufficient DNA insertion mecha- 
nisms, such as transposons. We explored bio- 
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engineering approaches of CRISPR-Cas effectors 
to facilitate DNA transposition (fig. S1). Cas9 
binding to DNA generates an R-loop structure, 
exposing a substrate for enzymes that act on 
single-stranded DNA (ssDNA). By tethering nick- 
ase Cas9(D10A) to the ssDNA transposase TnpA 
from Helicobacter pylori IS608 (17, 18), we ob- 
serve targeted DNA insertions in vitro and in 
Escherichia coli that are dependent on TnpA 
transposase activity, Cas9 single guide RNA 
(sgRNA), and the presence of an insertion site 
within the ssDNA. However, the requirement 
of an ssDNA donor will necessitate continued 
development for efficient synthesis and delivery 
to cells. 

A number of CRISPR-Cas systems lacking ac- 
tive nuclease domains have been identified pre- 
viously, including minimal type I loci lacking the 
Cas3 helicase-nuclease (19) and type V loci con- 
taining a Cas12 effector with a naturally inac- 
tivated RuvC-like nuclease domain (20). The 
absence of nuclease domains raises questions 
about the biological function of these CRISPR- 
Cas systems that can bind but not cleave DNA. 
Recently, an association between Tn7-like trans- 
posons and subtype I-F, subtype I-B, or subtype 
V-K (formerly, V-U5) CRISPR-Cas systems was 
reported (21, 22). The CRISPR-Cas-associated 
Tn7-like transposons contain tnsA, tnsB, tnsC, 
and tniQ genes (2D), similar to the canonical Tn7 
heterotrimeric TnsABC complex (23, 24). Tn7 
is targeted to DNA via two alternative path- 
ways that are mediated, respectively, by TnsD, 
a sequence-specific DNA binding protein that 
recognizes the Tn7 attachment site (25, 26), and 
TnsE, which facilitates transposition into conju- 
gal plasmids and replicating DNA (27). 

The association between Tn7-like transposons 
and CRISPR-Cas systems suggests that the trans- 
posons might have hijacked CRISPR effectors to 


generate R-loops in target sites and facilitate the 
spread of transposons via plasmids and phages 
(21). In the case of subtype V-K, the position of 
the CRISPR-Cas locus is frequently conserved in 
predicted transposons, suggesting that CRISPR- 
Cas is linked with transposition (22). However, 
because canonical Tn7 transposons often carry 
cargo genes with defense functions that are 
beneficial to the host cell (24), it is also pos- 
sible that CRISPR-Cas may be cargo genes. To 
date, no functional data on transposon-encoded 
CRISPR-Cas systems have been reported. Here 
we show that Tn7-like transposons can be di- 
rected to target sites via CRISPR RNA (crRNA)- 
guided targeting and we elucidate the mechanism 
of crRNA-guided Tn7 transposition. We further 
demonstrate that Tn7 transposition can be re- 
programmed to insert DNA into the genome of 
E. coli, highlighting the potential of using RNA- 
guided Tn7-like transposons for genome editing. 


Characterization of a transposon 
associated with a type V CRISPR system 


Among the transposon-encoded CRISPR-Cas var- 
iants, the subtype V-K are the simplest because 
they contain a single-protein CRISPR-Cas effec- 
tor (20, 21, 28), Cas12k (formerly, C2c5). Subtype 
V-K systems are thus far limited to cyanobac- 
teria, and the latest nonredundant set includes 
63 loci that, in the phylogenetic tree of Cas12k, 
split into four major branches, covering a broad 
taxonomic range of Cyanobacteria (22). All V-K 
systems are embedded within predicted Tn7-like 
transposable elements with no additional cas 
genes, suggesting that, if they are active CRISPR- 
Cas systems, they might rely on adaptation mod- 
ules supplied in trans. Of the 560 analyzed V-K 
spacers, only six protospacer matches were iden- 
tified: three from cyanobacterial plasmids and 
three from single-stranded transposons of IS200 
or IS650 families (22). These findings suggest 
that V-K systems might provide a biological ad- 
vantage for the host transposons by directing 
integration into other mobile genetic elements, 
to enhance transposon mobility while minimiz- 
ing damage to the host. 

For experimental characterization, we selected 
two Tn7-like transposons encoding subtype V-K 
CRISPR-Cas systems [hereafter, CAST (CRISPR- 
associated transposase)]. The selected CAST loci 
were 20 to 25 kb in length and contained Tn7- 
like transposase genes at one end of the trans- 
poson with a CRISPR array and Cas12k on the 
other end, flanking internal cargo genes (Fig. 1A 
and fig. S2, A and B). We first cultured the cyano- 
bacteria Scytonema hofmanni (UTEX B 2349) 
(Fig. 1B) and Anabaena cylindrica (PCC 7122) 
and performed small RNA sequencing to deter- 
mine whether the CRISPR-Cas systems are ex- 
pressed and active. For both loci, we identified 
along putative trans-activating crRNA (tracrRNA) 
that mapped to the region between Cas12k and 
the CRISPR array, and in the case of S. hofmanni 
(ShCAST) we detected crRNAs 28 to 34 nucleo- 
tides (nt) long, consisting of 11 to 14 nt of direct 
repeat (DR) sequence with 17 to 20 nt of spacer 
(Fig. 1C and fig. S2C). 
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To investigate whether ShCAST and AcCAST 
function as RNA-guided transposases, we cloned 
the four CAST genes (tnsB, tnsC, tniQ, and 
Cas12k) into a helper plasmid (pHelper) along 
with the endogenous tracrRNA region and a 
crRNA targeting a synthetic protospacer (PSP1). 


We predicted the ends of the transposons by 
searching for TGTACA-like terminal repeats sur- 
rounded by a duplicated insertion site (27) and 
constructed donor plasmids (pDonor) contain- 
ing the kanamycin resistance gene flanked by the 
transposon left end (LE) and right end (RE). 


Given that CRISPR-Cas effectors require a pro- 
tospacer adjacent motif (PAM) to recognize tar- 
get DNA (29), we generated a target plasmid 
(pTarget) library containing the PSP1 sequence 
flanked by a 6N motif upstream of the protospacer. 
We coelectroporated pHelper, pDonor, and 
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Fig. 1. Targeting requirements for CRISPR-associated transposase 
(CAST) systems. (A) Schematic of the S. hofmanni CAST locus containing 
Tn7-like proteins, the CRISPR-Cas effector Cas12k, and a CRISPR array. 
(B) Fluorescent micrograph of the cyanobacteria S. hofmanni. Scale bars, 
40 um. (C) Alignment of small RNA sequencing reads from S. hofmanni. 
The location of the putative tracrRNA is marked. (D) Schematic of 
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experiment to test CAST system activity in E. coli. kanR, kanamycin 
resistance gene; F, forward primer; R, reverse primer. (E) PAM motifs for 
insertions mediated by ShCAST and AcCAST. (F) ShCAST and AcCAST 
insertion positions identified by deep sequencing. (G) Insertion frequency 
of SACAST system in E. coli with pTarget substrates as determined by 
ddPCR. Error bars represent SD from n = 3 replicates. n.t., nontargeting. 
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Fig. 2. Genetic requirements for RNA-guided insertions. (A) Genetic requirement of tnsB, 
tnsC, tniQ, Cas12k, and tracrRNA on insertion activity. Deleted components are indicated 

by a dashed outline. (B) Insertion activity of six tracrRNA variants expressed with the 
pJ23119 promoter. (C) Schematic of tracrRNA and crRNA base pairing and two sgRNA 
designs highlighting the linker sequence (blue). (D) Insertion activity into pTarget containing 
ShCAST transposon ends relative to activity into pTarget without previous insertion. Error 


bars represent SD from n = 3 replicates. 


pTarget into E. coli and extracted plasmid DNA 
after 16 hours (Fig. 1D). We detected insertions into 
the target plasmid (pInsert) by polymerase chain 
reaction (PCR) for both ShCAST and AcCAST, 
and deep sequencing confirmed the insertion of 
the LE into pTarget. Analysis of PAM sequences 
in pInsert plasmids revealed a preference for GIN 
PAMs for both ShCAST and AcCAST systems, 
suggesting that these insertions result from 
Cas12k targeting (Fig. 1E and fig. S3, A and B). 
We next examined the position of the donor in 
plInsert products relative to the protospacer. In- 
sertions were detected within a small window 60 
to 66 base pairs (bp) downstream from the PAM 
for ShCAST and 49 to 56 bp downstream from 
the PAM for AcCAST (Fig. 1F). No insertions 
were detected in the opposite orientation for 
either system, indicating that CAST functions 
unidirectionally. Although DNA insertions could 
potentially arise from genetic recombination in 
E. coli, the discovery of an associated PAM se- 
quence and the constrained position of insertions 
argues against this possibility. 

To validate these findings, we transformed 
E. coli with ShCAST pHelper and pDonor plas- 
mids along with target plasmids containing a 
GGTT PAM, an AACC PAM, and a scrambled 
nontarget sequence. We assessed insertion events 
by quantitative droplet digital PCR (ddPCR), 
which revealed insertions of the donor only in 
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the presence of pHelper and a pTarget contain- 
ing a GGTT PAM and crRNA-matching proto- 
spacer sequence (Fig. 1G). Additional experiments 
with 16 PAM sequences confirmed a preference 
for NGTN motifs (fig. S3C). As further validation, 
we recovered pInsert products and performed 
Sanger sequencing of both LE and RE junctions. 
All sequenced insertions were located 60 to 
66 bp from the PAM and contained a 5-bp dup- 
licated insertion motif flanking the inserted 
DNA (fig. S4), consistent with the staggered DNA 
breaks generated by Tn7 (30). Because Tn7 
inserts into a CCCGC motif downstream of its 
attachment site, we hypothesized that the se- 
quence within the insertion window might also 
be important for CAST function. We generated 
a second target library with an 8N motif located 
55 bp from the PAM and again cotransformed 
the library into E. coli with ShCAST pHelper and 
pDonor followed by deep sequencing (fig. S5A). 
We observed only a minor sequence preference 
upstream of the LE in pInsert, with a slight T or 
A preference three bases upstream of the inser- 
tion site (fig. S5, B to D). ShCAST can therefore 
target a wide range of DNA sequences with min- 
imal targeting rules. Together, these results in- 
dicate that AcCAST and ShCAST catalyze DNA 
insertion in a heterologous host and that these 
insertions are dependent on a targeting proto- 
spacer and a distinct PAM sequence. 


Genetic requirements for RNA-guided 
insertions 

We next sought to determine the genetic require- 
ments for ShCAST insertions in E£. coli and 
constructed a series of pHelper plasmids with de- 
letions of each element. Insertions into pTarget 
required all four CAST proteins and the tracrRNA 
region (Fig. 2A). To better characterize the tracrRNA 
sequence, we complemented pHelperatracrrna 
with various tracrRNA driven by the pJ23119 
promoter. Expression of the 216-nt tracrRNA 
variant 6 alone was sufficient to restore DNA 
transposition (Fig. 2B). The 3’ end of the tracrRNA 
is predicted to hybridize with a crRNA contain- 
ing 14 nt of the DR sequence, and we designed 
sgRNA testing two linkers between the tracrRNA 
and crRNA sequences. Both designs supported 
insertion activity in the context of the tracrRNA 
variant 6 (Fig. 2C). We observed that expression 
of tracrRNA or sgRNA with the pJ23119 promot- 
er resulted in a fivefold increase in the insertion 
activity compared with the natural locus, sug- 
gesting that RNA was rate limiting during heter- 
ologous expression. 

As ShCAST does not destroy the protospacer 
upon DNA insertion, we asked whether multiple 
insertions could occur in pTarget, or if these are 
inhibited, as with canonical Tn7 (31, 32). We gen- 
erated target plasmids containing LE + RE, or LE 
alone, and measured ShCAST transposition ac- 
tivity at six nearby protospacers. We observed a 
strong inhibitory effect on transposition at a 
protospacer 62 bp from the LE (<1% of relative 
activity to pTarget), and only 5.7% relative ac- 
tivity 542 bp from the LE (Fig. 2D), indicating 
that CAST transposon ends act in cis to prevent 
multiple insertions. The presence of LE alone 
resulted in a weaker inhibitory effect, and we 
observed 61.1% of activity at 542 bp away from 
the transposon end (fig. S6, A and B). 

Our original pDonor contained 2.2 kb of cargo 
DNA, and we next tested the effect of donor 
length on ShCAST activity ranging from 500 bp 
to 10 kb. We observed a twofold higher insertion 
rate with a 500-bp donor and a similar rate of 
insertions with 10 kb of payload compared with 
the original pDonor (fig. S6C). We were unable to 
detect rejoined pDonor backbone during trans- 
position in £. coli (fig. S6é, D and E), suggesting 
that a linear donor backbone is formed, not a 
rejoined product, consistent with the known re- 
action products of canonical Tn7 (30, 33). Finally, 
we investigated the requirement of the LE and RE 
transposon end sequences contained in pDonor 
for transposition. Removal of all flanking ge- 
nomic sequences or the 5bp duplicated target 
sites had little effect on insertion frequency, 
and ShCAST tolerated truncations of LE and RE 
to 113 and 155 bp, respectively (fig. S7A). Removal 
of additional donor sequence completely abol- 
ished transposase activity, which is consistent 
with the loss of predicted Tn7 TnsB-like binding 
motifs (fig. S7, B and C). 


In vitro reconstitution of ShCAST 


Although our data strongly suggested that SnaCAST 
mediates RNA-guided DNA insertion, to exclude 
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the requirement of additional host factors, we 
next sought to reconstitute the reaction in vitro. 
We purified all four ShCAST proteins (fig. S8A) 
and performed in vitro reactions using pDonor, 
pTarget, and purified RNA (Fig. 3A). Addition of 
all four protein components, crRNA, and tracrRNA 
resulted in DNA insertions detected by both 
LE and RE junction PCRs, as did reactions 
containing the four protein components and 
sgRNA (Fig. 3B). The truncated tracrRNA vari- 
ant 5 was also able to support DNA insertion in 
vitro, in contrast with the activity observed in 
E. coli. SuCAST-catalyzed transposition in vitro 
occurred between 37° and 50°C and depended 
on adenosine triphosphate and Mg”* (fig. S8, B 
and C). To confirm that in vitro insertions are in 
fact targeted, we performed reactions with tar- 
get plasmids containing a GGTT PAM, an AACC 
PAM, and a scrambled nontarget sequence, and 
we could only detect DNA insertions into the 
GGTT PAM substrate with the target sequence 
(Fig. 3C). In vitro DNA transposition depended 
on all four CAST proteins, although we identified 
weak but detectable insertions in the absence of 
tmiQ (Fig. 3D). 

We were unable to detect DNA cleavage in 
the presence of Cas12k and sgRNA across a range 
of buffer conditions (fig. S8D), which is consist- 
ent with the predicted lack of nuclease activity of 
Cas12k. To determine whether other CRISPR- 
Cas effectors could also stimulate DNA transpo- 
sition, we performed reactions with tnsB, tnsC, 
and tniQ, along with dCas9 and a sgRNA target- 
ing the same GGTT PAM substrate. We were 
unable to detect any insertions after dCas9 incu- 
bation (Fig. 3E), which indicates that the func- 
tion of Cas12k is not merely DNA binding, and 
that DNA transposition by CAST does not simply 
occur at R-loop structures. As final validation, we 
transformed in vitro reaction products into E. coli 
and performed Sanger sequencing to determine 
the LE and RE junctions. All sequenced donors 
were located in pTarget 60 to 66 bp from the 
PAM and contained duplicated 5-bp insertion 
sites, demonstrating complete reconstitution of 
ShCAST with purified components. 


ShCAST mediates efficient and precise 
genome insertions in E. coli 


To test whether ShCAST could be reprogrammed 
as a DNA insertion tool, we selected 48 targets 
in the E. coli genome and cotransformed pDonor 
and pHelper plasmids expressing targeting sgRNAs 
(Fig. 4A). We detected insertions by PCR at 29 of 
the 48 sites (60.4%) and selected 10 sites for 
additional validation (fig. S9A). We performed 
ddPCR to quantitate insertion frequency after 
16 hours and measured rates up to 80% at PSP4:2 
and PSP49 (Fig. 4B). This high efficiency of 
insertion was surprising given that insertion 
events were not selected for by antibiotic re- 
sistance, so we performed PCR of target sites 
to confirm. We detected the 2.5-kb insertion 
product in the transformed population (Fig. 
4C). Restreaking transformed E. coli yielded 
pure single colonies, the majority of which 
contained the targeted insertion (fig. S9B), and 
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the high efficiency of integration was main- 
tained with a variety of donor DNA lengths 
(fig. S9C). We analyzed the position of genome 
insertions by targeted deep sequencing of the 
LE and RE junctions and observed insertions 
within the 60- to 66-bp window at all 10 sites 
(Fig. 4D and fig. S10A). 

We next assayed the specificity of RNA-guided 
DNA transposition. We performed unbiased 
sequencing of donor insertion sites after Tn5 
tagmentation of genomic DNA. We observed 
one prominent insertion site in each sample, 
which mapped to the target site and contained 
>50% of the total insertion reads (Fig. 4E). The 
remaining off-target reads were scattered across 


the genome, and analysis of the top off-target 
sites revealed strong overlap between sam- 
ples, demonstrating that these events are inde- 
pendent of the guide sequence (fig. SIOB and 
table S5). Top off-target sites were located near 
highly expressed loci such as ribosomal genes, 
serine-tRNA ligase, and enolase, although in- 
sertion frequency in these regions were all <1% 
of the on-target site (table S5). We identified 
one potential RNA-guided off-target after tar- 
geting of PSP42, which contains four mis- 
matches to the guide sequence (fig. S10C). 
Together, these results indicate that SaCAST 
robustly and precisely inserts DNA into the 
target site. 
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Fig. 3. In vitro reconstitution of an RNA-guided transposase. (A) Schematic of in vitro 
transposition reactions with purified ShCAST proteins and plasmid donor and targets. 

(B) RNA requirements for in vitro transposition. plnsert was detected by PCR for LE and 

RE junctions. All reactions contained pDonor and pTarget. Schematics indicate the location 
of primers and the expected product sizes for all reactions. (C) Targeting specificity of SHCAST 
in vitro. All reactions contained ShCAST proteins and sgRNA. (D) Protein requirements for 
in vitro transposition. All reactions contained pDonor, pTarget, and sgRNA. (E) CRISPR-Cas 
effector requirements for in vitro transposition. All reactions contained SHCAST proteins, 
pDonor, and pTarget. (F) Chromatograms of plnsert reaction products after transformation 
and extraction from E. coli. LE and RE elements are highlighted and the duplicated insertion 
sites denoted. For all panels, ShCAST proteins were used at a final concentration of 50 nM, 
and n = 3 replicates for all reactions were performed with a representative image shown. 
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Discussion 

Here we demonstrate that CRISPR-Cas systems 
associated with Tn7-like transposon mediate RNA- 
guided DNA transposition and elucidate its 
mechanism. ShCAST mediates unidirectional in- 
sertions in a narrow window downstream of the 


Fig. 4. SaCAST mediates genome insertions 
in E. coli. (A) Schematic of experiment to test 
for genome insertions in E. coli. (B) Insertion 
frequency at 10 tested protospacers after 
ShCAST transformation. Insertion frequency 
was determined by ddPCR on extracted 
genomic DNA. Error bars represent SD 

from n = 3 replicates. (C) Flanking PCR 

of three tested protospacers in a population 
of E. coli after SACAST transformation. 
Schematics indicate the location of primers 
and the expected product sizes. (D) Insertion 
site position as determined by deep sequencing 
after ShHCAST transformation. (E) Insertion 
positions determined by unbiased donor 
detection. The location of each protospacer 

is annotated along with the percent of 

total donor reads that map to the target. 


Fig. 5. Model for RNA-guided DNA transposition. 


The ShCAST complex that consists of Cas12k, 
TnsB, TnsC, and TniQ mediates insertion of DNA 
60 to 66 bp downstream of the PAM. Transposon 
LE and RE sequences, along with any additional 
cargo genes, are inserted into DNA resulting 

in the duplication of 5-bp insertion sites. 
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target and inhibits repeated insertions into a single 
target site (Fig. 5). Although ShCAST and AcCAST 
exhibit similar PAM preferences, one notable 
difference is that their respective positions of 
insertion, relative to the PAM, differ by 10 to 11 bp, 
which roughly corresponds to one turn of DNA. 


Deeper exploration of microbial genomes is ex- 
pected to uncover CAST systems with a range of 
diverse properties, including targeting prefer- 
ence and activity across different conditions. 
Targeted DNA insertion by ShCAST results 
in the incorporation of LE and RE elements and 
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is therefore not a scarless integration method. 
One potential generalizable strategy for the use 
of CAST in the therapeutic context would be to 
insert corrected exons into the intron before 
the mutated exon (fig. S11). CAST could also be 
used to insert transgenes into “safe harbor” loci 
(34) or downstream of endogenous promoters so 
that the expression of transgenes of interest can 
benefit from endogenous gene regulation. 

Further studies should improve our under- 
standing of the function of each transposase 
subunit in the CAST complex, notably, TniQ, 
which contains a predicted DNA binding do- 
main. We originally hypothesized that TniQ is 
analogous to the site-specific DNA binding pro- 
tein TnsD of Tn7 and, therefore, might be dis- 
pensable for RNA-guided insertions; however, 
we observed that TniQ is required for RNA- 
guided insertions in E. coli. The observation that 
in vitro transposition can occur to a limited ex- 
tent in the absence of TniQ is compatible with a 
model in which TniQ facilitates the formation of 
the CAST complex and is not essential for cat- 
alytic function. Therefore, it might be possible 
to engineer simplified versions of CAST systems 
without TniQ or with fragments of TniQ, 

Our analysis indicated that ShCAST is specific 
but, under overexpression conditions, can inte- 
grate at nontargeted sites in the E. coli genome 
via Cas12k-independent mechanisms, and this 
guide-independent integration seems to favor 
highly expressed genes. We also observed non- 
targeted insertions into pHelper in £. coli that 
were independent of Cas12k (fig. $12) and remi- 
niscent of TnsE-mediated Tn7 insertions into con- 
jugal plasmids and replicating DNA (27). Future 
protein engineering of the transposase compo- 
nents could improve the targeting specificity of 


CAST systems. 
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This work identifies a function for CRISPR- 
Cas systems beyond adaptive immunity that 
does not require Cas nuclease activity and pro- 
vides a strategy for targeted insertion of DNA 
without engaging homologous recombination 
pathways, with a particularly exciting potential 
for genome editing in eukaryotic cells. 
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A COPII subunit acts with 
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INTRODUCTION: The unfolded protein re- 
sponse (UPR) maintains homeostasis of the 
endoplasmic reticulum (ER) through a vari- 
ety of mechanisms, including ER-associated 
degradation (ERAD). ERAD recognizes ter- 
minally misfolded or unassembled proteins 
and retrotranslocates them across the ER 
membrane into the cytosol, where they are 
degraded by the proteasome. Certain disease- 
associated, aggregation-prone proteins, how- 
ever, cannot be cleared by ERAD and are 
disposed of by other pathways. Aggregation- 
prone proteins have been linked to a variety 
of neurodegenerative diseases, including a 
diverse group of disorders characterized as 
hereditary spastic paraplegias. Thus, under- 
standing how these alternate disposal pathways 
function has important medical implications. 


RATIONALE: Autophagy of the ER (ER-phagy) 
is a disposal pathway that targets ER domains 
and sequesters them into autophagosomes for 
delivery to the vacuole or lysosome for degra- 
dation. When ER-phagy is induced, ER-phagy 
receptors recruit their binding partner, Atg8 in 
yeast or LC3 in mammals, to discrete sites on 
the ER to facilitate autophagosome formation. 
These sites are formed on a highly dynamic 


ER-phagy 


Autophagosome 


Nutrient deprivation 


Aggregation-prone 
ER protein 


W/ ER-phagy receptor 
@ Atg8/Lc3 
@ Exported cargo 


tubular ER network that is stabilized by Lnp1, 
a conserved protein that resides at the tubular 
ER junctions. Although ER-phagy occurs at 
discrete sites on the contiguous ER network, 
the receptors that load ER into autophago- 
somes are dispersed throughout the network. 
How specific sites on the ER are targeted for 
ER-phagy is unknown. We reasoned that cyto- 
solic components might function with ER- 
phagy receptors to define these sites. COPII 
coat subunits were candidates for such factors 
because they are known to segregate mem- 
brane domains from the bulk ER. Correctly 
folded proteins destined to exit the ER are 
packaged by the COPII coat cargo adaptor 
complex Sec24-Sec23 into transport vesicles 
that bud from the ER and traffic to the Golgi. 


RESULTs: In budding yeast, we discovered 
that the Sec24 paralog Lst1, which forms a 
COPII cargo adaptor complex with Sec23, is 
essential for ER-phagy. ER-phagy is typically 
induced in yeast with the drug rapamycin, 
a TOR (target of rapamycin) kinase inhibitor 
that mimics nutrient deprivation. Rapamycin 
treatment up-regulated the Atg40 ER-phagy 
receptor and increased the number of Atg40 
puncta that colocalized with Lst1-Sec23. Atg40 


Secretory pathway 


COPII cargo adaptors package cargoes onto two distinct pathways. ER-phagy, which can be 
induced by nutrient deprivation or expression of aggregation-prone proteins, occurs at ERPHS 
that contain Lstl/SEC24C-Sec23. The absence of Sec24 from the ERPHS distinguishes them from 
the Sec24-containing ERES, which bud conventional COPI| vesicles that traffic to the Golgi. ER 
junctions stabilized by Lnpl are important for the formation of the ERPHS. 
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also bound to Lst1. Other COPII coat subunits 
did not colocalize with or interact with Atg40. 
Highlighting the importance of ER dynamics 
in ER-phagy, the Lst1-Sec23 complex failed 
to colocalize with Atg40 puncta and Atg8 in 
Lnp1-depleted cells, which are defective in ER- 
phagy. Consistent with a role for Lst1 in pack- 
aging ER domains, Atg40 degradation during 
ER-phagy required Lstl. Furthermore, both 
Lst1 and Atg40 were needed to sequester ER 

domains into autophago- 
somes. We also found that 
Read the full article ER stress induced by an 
at http://dx.doi. overexpressed, aggregation- 
org/10.1126/ prone secretory protein 
science.aau9263 up-regulated Atg40 expres- 
scat acta ett ston to-téduoe proteinaz: 
gregation in the ER. TOR-dependent autophagy 
transcriptional regulators, and not the UPR, 
modulated Atg40 expression. The role of Lst1 
in ER-phagy appears to be conserved, because 
its mammalian homolog, SEC24C, was sim- 
ilarly required for ER-phagy. Specifically, we 
found that SEC24C was needed for the deg- 
radation of two mammalian ER-phagy recep- 
tors, FAM134B and RTN3. SEC24C function is 
especially critical in the central nervous system, 
because its depletion in postmitotic neurons 
leads to their death. 


CONCLUSION: We discovered an unconven- 
tional function for a COPII cargo adaptor com- 
plex, Lst1/SEC24C-Sec23, in targeting domains 
of the ER for autophagy. Lst1/SEC24C-mediated 
ER-phagy was induced by nutrient deprivation 
or the expression of aggregation-prone pro- 
teins. In the latter case, ER-phagy prevented 
protein aggregation in the ER. Formation of 
Lst1/SEC24C-Sec23-containing ER-phagy sites 
(ERPHS) required Lnp1-stabilized ER junctions. 
ERPHS were found to be distinct from the 
Sec24-Sec23-containing ER exit sites (ERES) 
that bud canonical COPII-coated transport 
vesicles. Thus, our findings indicate that COPII 
cargo adaptors package distinct cargoes (ER 
domains or exported proteins) into membranes 
that are then directed onto alternate pathways, 
ER-phagy or the secretory pathway. The Lst1/ 
SEC24C-Sec23 and Sec24-Sec23 cargo adaptor 
complexes could sort their distinct cargoes 
through an interaction with the cytoplasmic 
domain of an ER transmembrane protein: the 
ER-phagy receptor or an exported protein, re- 
spectively. These distinct ER-trafficking routes 
are regulated by separate stress-response path- 
ways. ER-phagy depends on TOR-dependent 
autophagy transcriptional regulators, whereas 
protein homeostasis on the secretory pathway 
is modulated by the UPR. 
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The COPII-cargo adaptor complex Lst1-Sec23 selectively sorts proteins into vesicles 
that bud from the endoplasmic reticulum (ER) and traffic to the Golgi. Improperly 
folded proteins are prevented from exiting the ER and are degraded. ER-phagy is an 
autophagic degradation pathway that uses ER-resident receptors. Working in yeast, 
we found an unexpected role for Lst1-Sec23 in ER-phagy that was independent from 
its function in secretion. Up-regulation of the stress-inducible ER-phagy receptor 
Atg40 induced the association of Lst1-Sec23 with Atg40 at distinct ER domains to 
package ER into autophagosomes. Lstl-mediated ER-phagy played a vital role in 
maintaining cellular homeostasis by preventing the accumulation of an aggregation- 
prone protein in the ER. Lst1 function appears to be conserved because its mammalian 
homolog, SEC24C, was also required for ER-phagy. 


esicle transport cargo adaptors sort and 

concentrate cargo into transport vesicles at 

specific stages of membrane traffic (7, 2). At 

the endoplasmic reticulum (ER), the Sec24 

subunit of the major COPII-cargo adaptor 
complex, Sec24-Sec23, binds to the cytosolic do- 
main of ER transmembrane proteins and pack- 
ages them into vesicles that traffic to the Golgi 
(1). Yeast also have two Sec24: paralogs, Iss1/ 
Sfb2 (56% identity) and Lst1/Sfb3 (23% identity), 
which form secretory cargo adaptor complexes 
with Sec23. Mammals have four SEC24 isoforms. 
The mammalian SEC24 homologs (SEC24A and 
SEC24B) are 50% identical but only share 20% 
identity with the Lstl homologs (SEC24C and 
SEC24D), which are also 50% identical to each 
other (3). 

ER homeostasis is maintained by the unfolded 
protein response (UPR), which is up-regulated 
by ER stressors, such as the accumulation of 
unfolded or misfolded proteins in the ER. The 
UPR restores homeostasis by a variety of mech- 
anisms, including the activation of signaling 
pathways that increase the production of the 
chaperones and enzymes that promote and 
regulate protein folding (4). To prevent improp- 
erly folded proteins from entering the secretory 
pathway, ER-associated degradation (ERAD) 
recognizes terminally misfolded proteins and 
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retrotranslocates them into the cytosol for deg- 
radation by the proteasome (5). ERAD, how- 
ever, is unable to clear all aberrant proteins from 
the ER (6). Macroautophagy (herein called auto- 
phagy) of the ER, or ER-phagy, could be part 
of a global ER stress response that restores 
cellular homeostasis when ERAD and/or the 
UPR fail to respond. There are two major types 
of autophagy pathways, bulk and selective (7). 
During starvation, bulk autophagy uses auto- 
phagosomes to scavenge cytoplasmic components 
for nutrients. To rapidly increase autophagosome 
number, COPII vesicles are rerouted to serve as a 
membrane source for bulk autophagy (7, 8). 
Selective pathways instead use autophagy re- 
ceptors to package cargo into autophagosomes. 
These cargoes include damaged or superfluous 
organelles, protein aggregates, and pathogens (9). 
Although ER-phagy was initially described in 
2005 (10), it was not until the first ER-phagy 
receptors were identified that the process was 
thought to be selective (11, 12). ER-phagy occurs 
at discrete foci on the ER (13); however, the auto- 
phagy receptors that load ER into autophago- 
somes are dispersed throughout the contiguous 
network of the ER. How sites on the ER are 
targeted for ER-phagy is unclear. We reasoned 
that the cytosolic machinery that recognizes and 
binds to ER-phagy receptors may play a role in 
marking specific sites on the ER where auto- 
phagy will occur. Because COPII coat subunits 
are known to participate in membrane-budding 
events at the ER (J), we investigated whether 
coat subunits play a role in sequestering ER do- 
mains into autophagosomes during ER-phagy. 


Lstl is required for ER-phagy but not 
bulk autophagy 


ER-phagy is typically induced in yeast by treating 
cells for 12 to 24 hours with rapamycin, a TOR 


(target of rapamycin) kinase inhibitor that mim- 
ics nutrient starvation (10, 77). Rapamycin also 
up-regulates the expression of the ER-phagy 
receptor Atg40, which increases ER-phagy activ- 
ity (11). Atg40, a reticulon-like ER membrane 
protein, is required for cortical and cytoplasmic 
ER degradation and also contributes to nuclear 
ER degradation. Nuclear ER turnover is predom- 
inantly facilitated by Atg39, a nucleophagy re- 
ceptor (11). 

We initiated our studies by investigating 
whether COPII coat subunits play a role in 
ER-phagy. The long incubation period with 
rapamycin to induce ER-phagy precluded us 
from analyzing temperature-sensitive mutants 
harboring COPII mutations because they die 
quickly at their restrictive temperature. The 
Ist1A, iss1A, and sec24-3A mutants, however, do 
not appreciably impair growth or secretion 
(14, 15). In sec24-3A cells, bulk autophagy is 
defective as a consequence of a failure to 
phosphorylate Sec24 at T324, T325, and T328 
(15). Sec24 phosphorylation at these sites, 
which are conserved in Issl1, enables COPII- 
coated vesicles to bind the core autophagy 
machinery (15). We found a defect in the 
degradation of the ER protein Sec61 in Jst7A, 
but not in the sec24-3A issIA and issIA mutants 
(Fig. 1, A to D, and fig. S1, A to C). One of the 
kinases that phosphorylates Sec24 during bulk 
autophagy, Hrr25 (15), was also dispensable for 
ER-phagy (fig. S1, D and E). Furthermore, 
Sec24 phosphorylation was not needed for 
pexophagy (fig. S2, A and B) and mitophagy 
(fig. S2, D and E). The degradation of Rtn], 
which marks ER tubules, the edges of sheets, 
and the nuclear ER in a small fraction of cells 
(16), was also defective in the Jst7/A mutant 
(Fig. 1, E and F, and fig. $3, A and B). Sim- 
ilarly, a third ER protein, Per33, was de- 
graded less efficiently in JstIA cells (fig. $3, C 
to F). Like Sec61, Per33 resides on the nuclear, 
cortical, and cytoplasmic ER (17). In addition to 
being induced by rapamycin, Lstl-mediated ER 
degradation was also induced by growth to the 
stationary phase (fig. S4, A to C). 

To determine whether Lstl1 functions in bulk 
autophagy, we measured the activation of vacu- 
olar alkaline phosphatase (Pho8A60) 4 hours 
after starvation (fig. S4D), as well as the cleavage 
of green fluorescent protein (GFP)-autophagy- 
related protein 8 (Atg8) in the vacuole (fig. S4, 
E and F) 24 hours after rapamycin treatment 
(18). When bulk autophagy is induced, GFP-Atg8 
and Pho8A60 are delivered to the vacuole, where 
Pho8A60 is activated by proteolytic cleavage. No 
defect in bulk autophagy was detected in JstIA 
cells by either assay. 

Additionally, Lst1 was not needed for pexo- 
phagy (fig. S2, B and C), mitophagy (fig. S2, E 
and F), or the biosynthetic cytosol-to-vacuole 
targeting pathway (fig. S2G), which also uses 
autophagy machinery (18). Thus, Lst1 and its 
paralog, Sec24, appeared to function in different 
autophagy pathways. Although phosphorylation 
of the Sec24: membrane distal surface is required 
for bulk autophagy (15), it was dispensable for 
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Fig. 1. Lstl is required for ER-phagy. (A) Translocation of Sec61-GFP to the vacuole was examined 
by fluorescence microscopy in WT and mutant cells 24 hours after rapamycin treatment. FM4-64 
was used to stain the vacuolar membrane. (B) The percentage of cells with Sec61-GFP delivered 
to the vacuole was quantitated from 300 cells. (C) Cleavage of Sec61-GFP to GFP in WT and mutant 
strains was analyzed by immunoblotting using anti-GFP antibody. A representative blot is shown. 
(D) Three separate experiments were used to quantitate the ratio of free GFP to Sec61-GFP from 
the data in (C). WT was set to 100%. (E) The translocation of Rtn1-GFP to the vacuole in WT 

and mutant cells was detected by fluorescence microscopy 24 hours after rapamycin treatment. 
(F) The percentage of cells with Rtn1-GFP delivered to the vacuole was quantitated from 300 cells. 
Scale bars in (A) and (E), 5 um. Error bars in (B), (D), and (F) represent SEM; n = 3 independent 
experiments. *P < 0.05, **P < 0.01, ***P < 0.001, Student's unpaired t test. 


ER-phagy. By contrast, Lst1 acted exclusively 
in ER-phagy. 

Lstl is primarily known for the formation of 
larger COPII-coated vesicles and the trafficking 
of glycosylphosphatidylinositol (GPI)-anchored 
proteins (7). Unlike Sec24, Lstl cannot package 
the ER-Golgi fusion machinery (SNAREs) into 
COPII vesicles. Consequently, vesicles contain- 
ing Lstl, but not Sec24, are unable to fuse with 
the Golgi (19). When we measured ER-phagy in 
two /st1 mutants (LSTI-B1 and LSTI-B2) that dis- 
rupt the packaging of GPI-anchored proteins 
into COPII vesicles (20), no ER-phagy defect was 
observed (fig. S5, A to D). Thus, the role of Lst1 
on the secretory pathway can be uncoupled from 
its contribution to ER-phagy. These results estab- 
lish an unanticipated role for this conserved 
coat protein in autophagy. 


Lstl acts in concert with Atg40 


Sec61-GFP-positive ER cisternae and foci accu- 
mulated in rapamycin-treated atg40A and IstI1A 
mutants (fig. S6é, A and B). Furthermore, the ER 
degradation defect in the JstJAatg40A double 
mutant was not significantly different from 
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either single knockout (fig. S6, C to F). These 
findings implied that Lstl and Atg40 function 
in the same pathway. Because increased levels 
of autophagy receptors increase ER-phagy and 
the number of ER-containing autophagosomes 
(11, 12), ER-phagy receptors are thought to play 
a role in packaging ER into autophagosomes 
(12). To address if Lst1 works with Atg40 to per- 
form this function, we induced Atg40 expression 
with rapamycin (12) and asked if this promotes 
an association with Lstl. To assess the coloca- 
lization of Lst1-3xGFP with Atg40-2xmCherry, 
we focused on noncortical Atg40 puncta be- 
cause previous studies indicated that these more 
central puncta have greater accessibility to the 
autophagy machinery (17). Sec24-GFP and Sec13- 
GFP, a component of the COPII coat scaffold 
complex (Sec13-Sec31) (1), were examined as con- 
trols. Although all COPII coat subunits mark ER 
exit sites (ERES) in nutrient-rich growth con- 
ditions (21), only Lst1-3xGFP and its partner, 
Sec23-GFP, showed increased colocalization with 
Atg40-2xmCherry puncta (Fig. 2, A to C, and fig. 
87, A to C). Unlike Atg40, Lstl and Sec23 ex- 
pression did not increase with rapamycin treat- 
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ment (fig. S7D). The number of Lst1-3xGFP and 
Sec23-GFP puncta was also unchanged (fig. S7E). 
Although previous studies showed that CUPI- 
driven Atg40 overexpression only increased 
ER-phagy in the presence of rapamycin (11), over- 
expression was sufficient to drive Atg40 coloc- 
alization with Lstl (fig. S8, A and B). Rapamycin 
also induced the expression of the nucleophagy 
receptor Atg39 (17) but did not induce the co- 
localization of Lst1 with Atg39 (fig. S8, C and D). 
Furthermore, Lst1 did not contribute to the deg- 
radation of the nuclear ER marker Hmeg] (fig. 
S8, E to H). 

Selective autophagy receptors bind to the 
ubiquitin-like protein Atg8 at sites of auto- 
phagosome formation (9). Consistent with the 
proposal that Lst1 functions with Atg40 to tar- 
get ER domains for degradation, rapamycin 
induced Atg8-Lst1 colocalization in wild-type 
(WT) but not atg40A cells (Fig. 2D and fig. SQA). 
By contrast, Atg8 showed increased colocaliza- 
tion with two COPII coat subunits required for 
bulk autophagy, Sec24 and Secl3 (22), in both 
WT and atg40A cells (Fig. 2D and fig. S9A). 
Because Sec23 is a binding partner for both 
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Fig. 2. Lstl and Sec23 colocalize with Atg40 and Atg8 in rapamycin- 
treated cells. (A) Representative images of cells treated for 

O or 6 hours with rapamycin. Arrowheads indicate Lst1-3xGFP that 
colocalized with Atg40-2xmCherry puncta. (B) Bar graph showing 

the percentage of Lst1-3xGFP or Sec24-GFP colocalizing with 
Atg40-2xmCherry puncta. (C) Bar graph showing the percentage 

of Sec23-GFP or Sec13-GFP colocalizing with Atg40-2xmCherry puncta. 
(D) Cells were treated for O or 3 hours with rapamycin, and the 


Lstl and Sec24, the colocalization of Atg8 with 
Sec23 was only partially dependent on Atg40 
(Fig. 2D and fig. SQA). 

Next, we investigated whether ER organiza- 
tion is important for the colocalization of Lst1 
with Atg40. Lnp] is an ER protein that resides 
at and stabilizes the three-way junctions that 
form when two tubules fuse to each other (23). In 
its absence, ER network rearrangements are dis- 
rupted and Atg40 puncta fail to access the auto- 
phagy machinery (17). As a consequence, ER is 
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not packaged into autophagosomes in /npIA cells 
(17). We found that Lst1-3xGFP failed to colocalize 
with Atg40-2xmCherry (Fig. 2E and fig. SOB; 
also see Pearson’s coefficient in fig. S9C) in InpIA 
cells. Furthermore, even though Lst1-3xGFP and 
RFP-Atg8 puncta appeared unaltered in the npIA 
mutant, Atg8 also failed to colocalize with Lst1 
(Fig. 2F and fig. S9D). We propose that Atg8- 
Lstl-colocalizing puncta (usually one per cell) 
represent active ER-phagy sites stabilized by 
Lnpl. Although Atg40 puncta tend to accumu- 
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RFP-Atg8 that colocalized with GFP-tagged COPII subunits was 
quantitated. (E) Left, arrowheads indicating Lst1-3xGFP that colocalized 
with Atg40-2xmCherry puncta 6 hours after rapamycin treatment. 
Quantitation is on the right. (F) Quantitation of RFP-Atg8 that 
colocalized with Lst1-3xGFP in WT and /np1A cells. Scale bars in (A) 
and (E), 5 um. Error bars in (B) to (F) represent SEM; n = 3 to 6 
independent experiments. NS, not significant (P = 0.05); *P < 0.05, 
**P < 0.01, ***P < 0.001, Student's unpaired t test. 


late at the cell periphery in Lnp1-depleted cells 
(17), the distribution of Atg40 puncta was un- 
perturbed in JstIA cells (fig. S9E). 

The localization studies described above sug- 
gested that Lstl and Atg40 interact with each 
other during ER-phagy. Lst1 could interact with 
Atg40 via its cytosolic domain (77); however, this 
possibility could not be tested because the cyto- 
plasmic domain was unstable. We were also un- 
able to express full-length Atg40 in bacteria. 
Instead, to address whether Lstl interacts with 
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Fig. 3. Lstl binds to Atg40 and facilitates the packaging of ER 
into autophagosomes. (A) Equimolar amounts (0.1 uM) of purified 
recombinant GST and GST fusion proteins were incubated with 

2 mg of yeast lysate prepared from Atg40-3xFLAG-untagged or 
-tagged cells treated with rapamycin. (B) Representative electron 
micrographs of WT and pep4A strains treated for 12 hours with 
rapamycin. The boxed areas in the left panels are magnified 

on the right. Black asterisk, an autophagic body containing an ER 


fragment; white asterisk, an autophagic body lacking an ER fragment; 
arrowhead, an ER fragment inside an autophagic body. Scale bars 

in the left and right panels represent 1 um and 500 nm, respectively. 
(C) Bar graph showing the average number of ER-containing 
autophagosomes. (D) Bar graph showing the average number 

of total autophagosomes. Error bars in (C) and (D) represent SEM; 
n= 100 cells. *P < 0.05, ***P < 0.001, Student's unpaired t test. 

N, nucleus; V, vacuole. 


Atg40, we performed binding studies with 
lysates prepared from untreated and rapamycin- 
treated cells expressing Atg40-3xFLAG. Atg40- 
3xFLAG from both lysates bound to purified, 
bacterially produced glutathione S-transferase 
(GST)-Lst1, but not to GST-Secl3, GST-Sec24 
(Fig. 3A and fig. S10, A and C), or GST-Sec23 
(fig. SIOB). GST-Lst1 also failed to bind to two 
other ER proteins, Ufel and Yip! (fig. S10D). 
These findings indicate that Lst1, but not Sec24 
or other COPII coat subunits, binds to Atg40. 
Secl13-Sec31 could have failed to associate with 
Atg40 because the binding of Lst1-Sec23 to 
Atg40 sterically hindered its interaction. Sup- 
porting the proposal that the interaction of 
Atg40 with Lstl1 is needed for ER-phagy, the 
rapamycin-induced degradation of Lst1-GFP 
required Atg40 and Atg40-GFP degradation re- 
quired Lst1 (fig. S11, A to H). 

To determine the number of ER-containing 
autophagosomes in the /st7A mutant, we exam- 
ined cells depleted of the vacuolar protease Pep4: 
by conventional electron microscopy (Fig. 3, B 
to D). Although autophagic bodies were absent 
in the atg/4A mutant, which blocks autopha- 
gosome formation (Fig. 3, B to D), the number 
of autophagosomes formed in the Jst7A mu- 
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tant was the same as that in WT (Fig. 3D). By 
contrast, a significant reduction in autophagic 
bodies that contained ER was observed in 
the /st1A mutant (Fig. 3, B and C). Consistent 
with the proposal that Lst1 works with Atg40 
to package cortical and cytoplasmic ER into 
autophagosomes, the /stJA defect was compara- 
ble to that seen in atg40A cells (Fig. 3, B and C). 


ER stress up-regulates Atg40 


In mammalian cells, the putative ER-phagy 
receptor FAM134B targets ERAD-insensitive sub- 
strates to autophagy (6). How the ER commu- 
nicates with the autophagy machinery when 
ERAD does not respond or is overwhelmed is 
unknown. 

The Z variant of human alpha-1 antitrypsin 
(ATZ) is an aggregation-prone ERAD sub- 
strate that is also targeted to the lysosome 
via non-autophagosome-mediated (24) and 
autophagosome-mediated pathways (25, 26). 
When ATZ is highly overexpressed in yeast 
under the inducible galactose promoter (GALI), 
excess ATZ not degraded by ERAD is directed 
to the vacuole through two pathways, vacuolar 
protein sorting and autophagy (27). Blocking 
the latter pathway leads to the accumulation of 


ATZ aggregates in the ER (27). Currently, it is 
unknown whether this autophagosome-mediated 
pathway is ER-phagy. To determine whether 
ATZ aggregates in the absence of Atg40 or 
Lst1, microsomal membrane fractions prepared 
from WT, atgI4A, atg40A, and IstIA cells were 
analyzed on sucrose gradients. Soluble ATZ was 
primarily found at the top of the WT gradient, 
whereas ATZ from mutant lysates was largely in 
the pellet (Fig. 4, A to C, and fig. S12, A and B). 
Aggregation appeared to result from a block in 
ER-phagy, because substantially less ATZ was ob- 
served in the pellet when lysates from the Jst/-B1 
mutant were examined (fig. S12C). The JstJ-B1 
mutant disrupts membrane traffic (20) but not 
ER-phagy (fig. S5). Aggregation was also specific 
to misfolded ATZ, because appreciably less WT 
M-variant alpha-1 antitrypsin (ATM) was ob- 
served in the pellet when lysates from Jst/A or 
atg40A cells were examined (Fig. 4D). Consistent 
with this observation, a portion of the ATM, but 
not ATZ, was secreted (fig. S12D). Thus, only 
mutant alpha-1 antitrypsin aggregated in the ER 
in the absence of ER-phagy. It is notable that in 
the pancreas, the loss of the ER-phagy receptor 
CCPGI1 leads to the accumulation of aggregated 
proteins in the ER (73). 
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Fig. 4. ATZ aggregates accumulate in atg40A and Ist1A mutants. 
Microsomal membrane fractions prepared from WT and mutant 
cells harboring plasmids expressing either ATZ (A to D) or 

ATM (D) were lysed in the presence of detergent and fractionated 


Previous studies showed that multiple COPII 
coat subunits, including Sec24 and Lstl, are tar- 
gets of the UPR (28). To determine whether 
Atg40 is also a UPR target, we overexpressed 
ATZ in the presence or the absence of Irel, an 
ER transmembrane kinase that triggers the UPR 
(29). ATZ induction enhanced Atg40 expression 
in WT cells (compare Fig. 5, A and B, with fig. 
S12E). Furthermore, Atg40 expression and ATZ 
accumulation were further enhanced in trelA 
mutant cells, which block the UPR and augment 
ER stress (4, 29) (Fig. 5, A and B). By contrast, 
overexpressed ATM did not increase Atg40 ex- 
pression in the zreJA mutant (fig. S12F). To de- 
termine whether the loss of Atg40 induces the 
UPR, we examined pathway induction by flow 
cytometry in cells carrying an integrated UPR- 
regulated GFP reporter. Whereas the atg39A 
(Fig. 5C) and sec24-3A issIA mutants behaved 
like WT (fig. S12G), the level of UPR induction in 
the atg40A and JstJA mutants was the same as 
that in an ERAD-deficient hrdJA strain (Fig. 5C) 

(5). Thus, although Atg40 was not found to bea 
UPR target, the UPR responded to the loss of 
ER-phagy. 

Because Atg40 expression is not under the 
control of the UPR, we next sought to identify 
the pathway that regulates it. ER stress is known 
to modulate the expression of core autophagy 
machinery (30). Therefore, we considered the 
possibility that autophagy transcriptional reg- 
ulators control Atg40 expression. Pho23, which 
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associates with the histone deacetylase large 
complex (Rpd3L), is a transcriptional repressor 
of autophagy that modulates autophagosome 
abundance via Atg9 (37). We found that Atg40, 
but not the UPR targets Kar2 and Lstl, were 
up-regulated in the 7pd3A and pho23A strains 
(Fig. 5, D and E). Atg40 and Atg9 mRNA levels 
also increased in these mutants (Fig. 5F), indi- 
cating that the Atg40 ER-phagy receptor is a 
target of the same transcriptional regulators 
that modulate Atg9 and other core autophagy 
machinery. 


Mammalian SEC24C is required 
for ER-phagy 


It was recently reported that SEC24A and 

SEC24B, but not SEC24C and SEC24D, contrib- 

ute to bulk autophagy (32). To determine wheth- 

er any of the SEC24 isoforms are required for 

ER-phagy in mammalian cells, we knocked down 

each of the four isoforms by small interfering 
RNA (siRNA) in U20S cells (fig. S13A). We then 
monitored the delivery of two ER-phagy recep- 
tors, FAM134B and RTN3, to lysosomes in the 
Torin2 (TOR inhibitor2)-treated SEC24-depleted 
cells. We chose to examine these two mamma- 
lian ER-phagy receptors because Atg40 has a do- 
main structure that is similar to FAM134B yet 
localizes to the tubular ER-like RTN3 (22, 33). 
3xFLAG-FAM134B was resistant to degradation 
in SEC24:C-depleted cells, but sensitive in cells 
depleted of SEC24A, SEC24B, or SEC24D (Fig. 6A 


Fraction Number 


on a sucrose gradient. Gradient fractions were blotted with antibody 
directed against alpha-1 antitrypsin. Soluble alpha-1 antitrypsin 
was at the top of the gradient and aggregates remained in the 


and fig. S13B). Moreover, the delivery of 3xFLAG- 
FAM134B to lysosomes (LAMP1) required SEC24C 
and ULK1, a component of the autophagosome 
biogenesis machinery (Fig. 6, B and C, and fig. 
$13C). The delivery of RTN3 to lysosomes was 
also SEC24C and ULK1 dependent (Fig. 6, D and 
E, and fig. S14). Thus, in mammals, as in yeast, 
TOR inhibition stimulated the delivery of 
ER to lysosomes via Lst1/SEC24C-mediated 
ER-phagy. 


Discussion 


Here, we report a noncanonical role for the 

COPII-cargo adaptor complex Lstl-Sec23 and 

show that it functions with the ER-phagy recep- 

tor Atg40 to specify ER domains for ER-phagy. 

By analogy to its role as a cargo adaptor in vesicle 

traffic (20), Lstl-Sec23 may sort ER domains 

into autophagosomes through an interaction 
with the cytoplasmic domain of Atg40 (fig. 
$15). Several lines of evidence highlight the 
key events mediated by Lstl on this pathway. 
First, when Atg40 expression was up-regulated 
by the TOR inhibitor rapamycin, the Lst1-Sec23 
complex associated with Atg40 puncta. Second, 
in accord with the proposal that the colocaliza- 
tion of Lst1 with Atg40 is needed for ER-phagy, 
Lstl1 failed to colocalize with Atg40 and Atg8 
in the /npIA mutant. Lnp1 is needed for ER 
organization and the incorporation of ER into 
autophagosomes (17). Third, consistent with a 
role in ER packaging, Lst1 was required for the 
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degradation of Atg40, and the number of ER- 
containing autophagosomes was reduced in the 
Ist1A mutant. 

To date, six ER-phagy receptors have been 
identified in mammalian cells (12, 13, 33-37). 
Although some receptors respond to starvation, 
others only respond to ER stress, and some re- 
spond to both. Here, we have shown that Atg40 
responds to starvation and ER stress. Atg40 
was up-regulated by both rapamycin (17) and 
the overexpression of an aggregation-prone 
misfolded secretory protein (ATZ). In the lat- 
ter case, the induction of Lstl-mediated ER- 
phagy reduced the level of aggregated ATZ in 
the ER (fig. S15). Consistent with the observa- 
tion that a TOR inhibitor up-regulates Atg40 
(11), we found that Atg40 expression is mod- 
ulated by autophagy transcriptional regula- 
tors. Recent studies have shown that ER stress 
inhibits TOR (38). Thus, TOR-dependent auto- 
phagy transcriptional regulators may also mod- 
ulate the up-regulation of ER-phagy receptors 
to reduce ER protein aggregation. 

Our findings imply that a specific subcomplex 
of the COPII coat that contains Lstl-Sec23 binds 
to Atg40 to package ER domains into autopha- 


gosomes during ER-phagy (fig. $15). ER-phagy 
occurs at distinct sites on the ER that we have 
named ERPHS (ER-phagy sites). The absence 
of Sec24 from these sites distinguishes them 
from the Sec24-containing ERES that bud con- 
ventional COPII vesicles (fig. S15). Because pre- 
vious studies in yeast did not reveal a substantial 
accumulation of cytosolic ER fragments during 
ER-phagy, it was suggested that ER domains are 
engulfed into autophagosomes on or in close 
proximity to the ER network (10) (fig. S15). We 
found that Lst1-GFP was degraded in the vacuole 
in an Atg40-dependent fashion (fig. S15), indicat- 
ing that ER fragments were packaged into auto- 
phagosomes before they completely uncoated. 
Because the Lst1-Sec23 complex cannot package 
SNAREs, fragments that bud from the ER can- 
not fuse with the Golgi before their incorpora- 
tion into autophagosomes. 

The formation of ERPHS in yeast was de- 
pendent on the ER protein Lnp1, which is needed 
for stable three-way ER junctions (fig. S15) (23). 
Consistent with this observation, the mamma- 
lian ER-phagy receptor TEX264 concentrates 
and colocalizes with LC3 (the Atg8 homolog in 


mammals) at ER junctions (36, 37). The locali- 


zation of LNP1 to ER junctions depends on 
ATLASTIN, which is also required for ER-phagy 
(16, 39, 40). Furthermore, ATLASTIN 3 (ATL3) 
has recently been reported to be an ER-phagy 
receptor (35). Mutations in ATLASTIN and LNP1 
are associated with hereditary spastic para- 
plegia (HSP) and HSP-like neuropathies (41, 42). 
The ER-phagy receptor FAM134B, which is 
needed for both ER-phagy and autophagosome- 
independent ER-to-lysosome degradation (72, 24), 
is linked to sensory neuron function (12). 

The mammalian Lst1 homolog SEC24C was 
also required for the delivery of ER sheets 
(FAM134B) and tubules (RTN3) to lysosomes 
for degradation. This observation is in accord 
with recent data showing that the interactome 
for the ER-phagy receptor RTN3 includes SEC24:C 
but not the other SEC24 isoforms (33). Although 
professional secretory cells are insensitive to 
SEC24C depletion, the loss of SEC24C in post- 
mitotic neurons leads to microcephaly and peri- 
natal death (43). Whereas SEC24C and SEC24D 
are ~50% identical, their functions only partially 
overlap in the brain (43), and we did not detect 
a substantial role for SEC24D in ER-phagy. Be- 
cause our experiments and the RTN3 interactome 
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Fig. 5. Atg40 expression is regulated by the Rpd3-Pho23 complex. 

(A) Atg40 expression was examined 24 hours after growth in 2% galactose. 
Western blot analysis was performed on lysates to detect ATZ (top), Atg40- 
3xFLAG (middle), and Adh1 (bottom). (B) The data in (A) were normalized to 
Adh1 and WT (—ATZ) was set to 100%. (C) UPR induction was assayed by flow 
cytometry in the absence or presence of 8 mM dithiothreitol (DTT). WT (—DTT) 
was set to 1.0. (D) Atg40-3xFLAG, Kar2, and Lstl levels were measured by 
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Western blot analysis in the indicated strains. (E) The data in (D) were 
normalized to Adh1 and WT was set to 100%. (F) Quantitative reverse 
transcription—polymerase chain reaction was used to assess MRNA levels. The 
data were normalized to actin MRNA levels and WT was set to 1.0. Error bars in 
(B), (C), and (F) represent SEM; n = 3 independent experiments. Error bars 
in (E) represent SEM; n = 4 to 8 independent experiments. NS, not significant 
(P = 0.05); *P < 0.05, **P < 0.01, ***P < 0.001, Student's unpaired t test. 
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Student's unpaired t test. 


analysis both used U2OS cells, it remains for- 
mally possible that other cell types may require 
SEC24D for ER-phagy. These findings suggest 
that ER-phagy plays a crucial role in maintain- 
ing neuronal homeostasis. 

In summary, we propose that in yeast and 
mammals, COPII cargo adaptors package differ- 
ent cargoes (secretory proteins or ER domains) 
into membranes and target them to distinct 
pathways. Our findings imply that ERES, which 
bud canonical COPII vesicles that traffic on the 
secretory pathway, are distinct from the ERPHS 
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used in autophagy. The secretory and ER-phagy 
pathways also appear to be regulated by different 
stress-response pathways; secretory protein ho- 
meostasis is regulated by the UPR, whereas ER- 
phagy depends on TOR-dependent autophagy 
transcriptional regulators. 
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PROTEIN DYNAMICS 


Proton uptake mechanism in 
bacteriorhodopsin captured by serial 
synchrotron crystallography 


Tobias Weinert", Petr Skopintsev'|, Daniel James'|, Florian Dworkowski”, 
Ezequiel Panepucci”, Demet Kekilli’, Antonia Furrer’, Steffen Briinle’, Sandra Mous”, 
Dmitry Ozerov*, Przemyslaw Nogly’, Meitian Wang”, Jérg Standfuss* 


Conformational dynamics are essential for proteins to function. We adapted time-resolved 
serial crystallography developed at x-ray lasers to visualize protein motions using synchrotrons. 
We recorded the structural changes in the light-driven proton-pump bacteriorhodopsin over 
200 milliseconds in time. The snapshot from the first 5 milliseconds after photoactivation 
shows structural changes associated with proton release at a quality comparable to that of 
previous x-ray laser experiments. From 10 to 15 milliseconds onwards, we observe large 
additional structural rearrangements up to 9 angstroms on the cytoplasmic side. Rotation of 
leucine-93 and phenylalanine-219 opens a hydrophobic barrier, leading to the formation of a 
water chain connecting the intracellular aspartic acid—96 with the retinal Schiff base. The 
formation of this proton wire recharges the membrane pump with a proton for the next cycle. 


roteins are intrinsically dynamic mole- 
cules that follow defined conformational 
rearrangements to drive the fundamental 
biochemistry of life. Structural studies of 
these dynamic processes are challenging, 
because structural intermediates are often short- 
lived. Pump-probe crystallography offers the 
opportunity to capture successive temporal snap- 
shots of light-activated processes to study protein 
dynamics within the crystallographic lattice at 
the level of individual atoms, at physiological tem- 
peratures and on wild-type proteins (7). 

X-ray free electron lasers (XFELs) and the 
emerging method of time-resolved serial femto- 
second crystallography (TR-SFX) have brought 
exciting opportunities for structural biologists 
(2, 3). Recent studies on photoactive yellow pro- 
tein (4, 5) show that TR-SFX is a powerful new 
tool for studying the structural dynamics of pro- 
teins within a crystalline environment. The ultra- 
fast but bright x-ray pulses from these powerful 
linear accelerators outrun most radiation dam- 
age effects (6) and allow the use of small crystals, 
resulting in high protein activation levels (4). The 
serial injection of crystals further favors the study 
of nonreversible reactions and the collection of 
redundant datasets of high quality. The great 
potential of the TR-SFX method for studying 
membrane transport can be realized when com- 
paring results (7) from freeze trapping studies of 
the light-driven proton pump bacteriorhodopsin 
(bR) to a whole series of structural states ranging 
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from the femtosecond (8) to the early millisecond 
regime (9). 

Primary energy conversion processes in biology 
are variations around one central theme: proton 
transfer across membranes. Time-resolved crys- 
tallographic studies provided much insight into 
how proton pumping is achieved in bR. In par- 
ticular, they focused on the early events in the 
pumping cycle, including the ultrafast process 
of energy capture through photoisomerization of 
retinal (8) and the proton release step from the 
retinal Schiff base (SB) toward the extracellular 
side of the membrane (9). The fundamental piece 
missing in our dynamic view of proton transport 
is how the protein rearranges to reprotonate the 
SB from the cytoplasmic side to reload substrate 
for the next pumping cycle. 

We answer this question by adapting injector- 
based serial crystallography for time-resolved 
measurements in the millisecond range at wide- 
ly available synchrotron sources. Requirements 
for time-resolved crystallography using the Laue 
method were never met for bR (7). Therefore, 
pioneering works to determine the structures 
of spectroscopic intermediates were carried out 
using freeze-trapping and mutagenesis (JO). Al- 
though mutagenesis and freeze trapping provide 
valuable insights and allow trapping of individ- 
ual structural intermediates, major controversies 
remained because mutations modify critical acti- 
vation switches and structures of mutants may 
not adequately reflect rearrangements in the 
native protein. Such structures may further suf- 
fer from radiation damage, are obtained at non- 
physiological temperatures, and lack the temporal 
dimension critical to understanding protein dy- 
namics. Serial millisecond crystallography (SMX) 
approaches work well with high-viscosity injec- 
tors (11), as the injectors extrude crystals slowly 
and allow sufficient x-ray exposure to collect 
high-resolution data with the photon flux avail- 


able at synchrotrons (12). Our recent work has 
shown that modern high-frame-rate and low- 
noise detectors make SMX a viable method for 
routine room-temperature structure determina- 
tion (13). By extending the setup with a simple 
class 3R laser diode, we enable time-resolved 
studies with millisecond time resolution (TR- 
SMX) and bring dynamic serial crystallography 
from the few operational XFELs to the large 
community of synchrotron users worldwide. We 
demonstrate the technology by recording the 
temporal evolution of two structurally distinct 
intermediate states of bR and their decay back 
to the dark state. The closed state obtained by 
TR-SMX at 0 to 5 ms shows the structural re- 
arrangements necessary for proton release that 
are associated with the spectroscopic M-state of 
bR with comparable quality and in agreement 
with previous XFEL experiments. The open state 
that we obtained from data collected 10 to 15 ms 
after activation is characterized by large con- 
formational rearrangements up to 9 A on the 
cytoplasmic side, which have been associated 
with the spectroscopic N-state of bR and have 
not yet been resolved by dynamic crystallog- 
raphy. These open the protein for the formation 
of a Grotthuss proton wire, a transient water 
chain that allows the reprotonation of the SB 
from Asp®°, the primary proton donor at the 
cytoplasmic side of the membrane. The obser- 
vations align with the classical alternate access 
model of membrane transport and further com- 
plete the dynamic view of vectorial proton trans- 
port in bR by resolving the critical proton uptake 
step at the end of the pumping cycle. 


Time-resolved serial 
millisecond crystallography 


The most straightforward experiment possible 
with our TR-SMX setup (Fig. 1) leaves the laser 
diode on while data from continuously light- 
activated protein molecules within the crystal 
are being collected. By comparing these data to 
serial data collected without illumination (fig. 
SI), it is possible to reveal dominant confor- 
mational changes in a photocycle analogous to 
steady-state spectroscopic techniques. From 
bR microcrystals with a size of about 35 um 
by 35 um by 3 um, we collected 114,052 light and 
119,374 dark diffraction patterns in the steady- 
state mode with data extending to a resolution of 
1.8 A (table S1). The quality of difference electron 
density maps [F, (light) — F,(dark)] improved with 
the number of included images with character- 
istic difference density peaks starting to plateau 
at about 10,000 images, indicating data conver- 
gence (fig. S2). The estimated activation level was 
38% (see materials and methods) and resulted in 
well-resolved difference density map features, 
with the strongest positive and negative peaks 
at 7.3 and 8.50, respectively. Structure factor 
extrapolation allows selective refinement of the 
activated species in a crystal by removing the 
contribution of nonactivated states from struc- 
ture factor amplitudes (14) (fig. S3 and materials 
and methods). Refinement against extrapolated 
steady-state data revealed two activated species 
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of bR (table S2). Although one subspecies had 
been observed in previous TR-SFX structures 
obtained at 1.7 and 8.3 ms after photoactivation, 
which we termed the “closed” state, the second 
subspecies featured large rearrangements on 
the cytoplasmatic side, indicating that the dy- 
namic view of proton pumping in bR had not yet 
been completed in the previous studies; hence, 
we termed this intermediate the “open” state 
(table $2). 

To follow the rise and decay of these inter- 
mediates and to obtain separate structures, we 
developed a data collection scheme, which trig- 
gers the laser diode using a transistor-transistor 
logic signal issued from a delay generator to 
achieve laser exposure synchronized with data 
collection at the detector (Fig. 1). In this setup, 


Laser on 


Injector 


bR crystals were exposed for 5 ms with laser light 
at the x-ray interaction region, while the photo- 
reaction was tracked by collecting 5-ms-long 
timepoints over 40 consecutive frames. This data 
collection sequence was continuously repeated 
while new crystals were resupplied by the high- 
viscosity injector (7). After about 5 hours of data 
collection, each of the timepoints contained 
~13,000 indexable diffraction patterns with the 
resolution extending to 2.3 A (table S1). 
Analysis of the data provides a “continuous” 
molecular movie of structural changes over 200 ms 
in time during which bR molecules complete their 
photocycle within the crystal. Within this time 
frame, we observe a long-term decay of difference 
map signal and complete restoration of the dark 
state within about 100 ms (movie S1). The rate of 


Laser Diode 


Laser on 


Occupancy in a.u. 


0-5 ms 


0-5 


this long-term decay (fig. S4) and the evolution 
of structural states as determined by occupancy 
refinements (Fig. 1) occur within the temporal 
range established by time-resolved spectroscopy 
on bR crystals (9, 15, 16). Successive structural 
intermediates typically overlap, which presents a 
general problem for time-resolved experiments. 
Following the rise and decay of intermediates 
in small steps, as demonstrated here, allows 
one to choose time delays at which occurring 
intermediates can best be separated. These re- 
sults demonstrate that time-resolved serial crys- 
tallography is not limited to XFEL sources but 
can also be done efficiently at synchrotrons. Re- 
sults are of a quality comparable to that of our 
previous time-resolved studies at XFELs on the 
same crystals (fig. S5), even though the resolution 


open-state 
closed-state 


5-10 10-15 15-20 20-25 25-30 
Time delay in ms 


Fig. 1. Experimental setup and evolution of structural intermediates 
over time. (A) Experimental pump-probe setup and TR-SMX data 
collection. The laser (green) originates from a class 3R laser diode and 
is controlled by way of two steering mirrors before being focused to 
intersect with the extrusion path of the high-viscosity injector (gray) and 
the x-rays (red). The laser and detector are triggered by a delay generator. 
Running the detector at 200 Hz, the data collection cycle consists of 
AO frames each 5-ms long. The laser is activated for 5 ms during 
collection of the first data frame and is then turned off for the remaining 
frames, during which the illuminated bR completes its photocycle. 

(B) Large-scale conformational changes on the cytoplasmic ends of 
helices E, F, and G at different timepoints [dark state in purple, closed 
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state (O to 5 ms) in blue, and open state (10 to 15 ms) in red] are evident 
in difference electron density maps [left: F.(O to 5 ms) — F,(dark); 

right: F,(10 to 15 ms) — F,(dark) at 30 with positive difference density 

in blue and negative difference density in yellow]. (C) Displacement 
occurs on the cytoplasmic side at the end of helix F. (D) Combining 

the dark-, closed- and open-state models included as alternative 
conformations into a single model and refining their occupancies against 
collected time-resolved data provide a measure of the structural 
evolution of the two intermediates over time. Biexponential fits against 
the normalized data are shown with solid lines. The fit resulted in 

decay times of 23.8 + 2.7 ms for the open conformation and 4.7 + 0.3 ms 
for the closed conformation. 
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was limited by the available photon flux. How- 
ever, the resolution is compensated by high 
levels of activation from the millisecond-long 
light exposure, more accurate data from the use 
of a photon counting detector, and a more stable 
monochromatic beam, as previously observed 
during de novo phasing (13). Owing to the slow 
extrusion rate, less than 20 ul of prepared micro- 
crystals were necessary to obtain these data, 
instead of milliliters needed for the XFEL exper- 
iments. The reduced sample consumption sub- 
stantially lowers the entry barrier for time-resolved 
crystallographic studies. Although the simple 
TR-SMX setup implemented at the Swiss Light 
Source delivers excellent results, there is great 
potential to improve the method further. Pink 
beams (17) and diffraction-limited sources (18) 
optimize beam characteristics for faster data 
acquisition. The Hadamard transform method 
(19) and continued software development (20) 
are means to improve data analysis. Together 
with stronger nanosecond lasers, as well as new 
fast and accurate detectors (27), these improve- 
ments will allow the time resolution to be pushed 
from milliseconds into the lower microsecond 
regime to cover the majority of biologically rele- 
vant protein dynamics. 


Completing the bR pumping cycle 

The molecular movies of structural changes in 
bR are an impressive example of how XFEL 
sources can be used to understand biology (7). 
Our TR-SMX measurements extend the dynam- 
ic view of bR into the second half of the pho- 
tocycle (Fig. 2), where the proton substrate is 
reloaded. 

Refining the closed-state structure obtained 
by steady-state SMX against extrapolated data 
from the 0- to 5-ms timepoint (table S2), we 
obtained a good fit to M-state structures ob- 
tained by cryotrapping (22) or TR-SFX (8, 9) 
[root mean square deviation (RMSD) of 0.46 A 
for the cryo-trapped structure 1IW9, 0.40 A for 
the 1.7-ms structure 5B6Z, and 0.27 A for the 
8.3-ms structure 6G7L]. Difference maps ob- 
tained from 10 to 15 ms onward, however, showed 
clear additional features at the cytoplasmic side 
of the protein (Fig. 1). When refining the open- 
state structure obtained by steady-state SMX 
against extrapolated 10- to 15-ms data, we ob- 
tained a better fit in the dark-state structures of 
the constitutively open Asp*°—>Gly/Phe’™—Cys/ 
Phe?’—Leu triple mutant [RMSD of 0.59 A for 
the x-ray structure 4FPD (23) and 0.85 A for the 
electron diffraction structure 1FBK (24)], which 
are considered analogous to the wild-type N-state 
obtained by photoactivation (fig. S6). The struc- 
tural rearrangements in the open state (10 to 
15 ms) with respect to the dark and closed states 
(0 to 5 ms) occur in the cytoplasmic ends of 
helices E, F, and G. The largest change in helix F 
corresponds to a movement of 9 A measured at 
the Ca position of Glu’ (Fig. 1 and movie S2), 
which resembles the movement of helix 6 upon 
activation of structurally related G protein-coupled 
receptors (fig. S7). The magnitude of the motion is 
consistent with transient movements of helices F 
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Fig. 2. Proton pumping in bacteriorhodopsin proceeds through three principal steps. (A) The 
proton pumping cycle in bR with spectroscopic intermediates (I, J, K, L, M, N, O) and the relevant time 
domains for the individual steps. (B) Overview of the three principal steps in the proton pumping 
cycle of bR. (C) At 10 to 15 ms, we observe the opening of the hydrophobic barrier between the primary 
proton donor Asp®° to the SB [dark state in purple, open state (10 to 15 ms) model in red]. The 
changes are accompanied by the ordering of three water molecules [positive F,(10 to 15 ms) — 
F,(dark) density in blue at 30]. (D) The O- to 5-ms timepoint shows the rearrangements necessary 
for proton release, in agreement with previous XFEL studies (8, 9) [dark state in purple, closed state 
(O to 5 ms) model in blue]. Arrows indicate the direction of proton transfer reactions. 


and G in the millisecond range observed by time- 
resolved spin labeling studies in purple mem- 
branes (25), suggesting that bR can complete 
the motion inside the crystal lattice (fig. S8). The 
final O-state does not accumulate well in bR crys- 
tals (16), possibly because crystal contacts, which 
change upon formation of the open state (fig. S8), 
affect the kinetics of the N-state to O-state tran- 
sition inside the crystal. However, the O-state is 
in a fast equilibrium with the N-state (26), sug- 
gesting that no large structural rearrangements 
occur in the transition. The small changes upon 
relaxation of retinal from the cis-isomer back to 
the trans-isomer are spread out in time, with 
associated difference map features disappearing 
around the 35- to 40-ms timepoint (movie S1). 
Our TR-SMX experiment thus completes the 
crystallographic view of the bR photocycle, by 
revealing the structural rearrangements over the 
timeframe in which the M-state to N-state tran- 
sition occurs inside crystals until the dark state 
is recovered. 


Structural changes in helices E and F involve 
large side-chain motions. The isomerizing retinal 
pushes against Trp"? in helix F starting at earlier 
timepoints (9). This motion continues in the tran- 
sition from the closed state to the open state, 
driving the displacement of helix F and allowing 
the aromatic ring of Phe’”® to move 9 A and re- 
place the ring of Phe’. This alters the hydro- 
phobic interaction network holding helix F in its 
original position. Moving helix F occurs concur- 
rently with the disordering of the C terminus 
starting from Arg””’ at the end of helix G (Fig. 3). 
By contrast, the structural changes in helix C 
are dominated by smaller side-chain rearrange- 
ments. The largest conformational change in 
helix C is a rotamer change of Leu? matching 
a displacement of the nearby Phe” side chain in 
helix G (movie $2). These changes create space 
for a chain of water molecules (Wat404, Wat453, 
Wat454) observed as strong positive difference 
map peaks [F,(10 to 15 ms) — F,(dark) above 30] 
connecting Asp®° on the cytoplasmic side with 
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the retinal SB (Fig. 2). Additional experimental 
evidence for such waters originates from a com- 
bination of Fourier transform infrared spectros- 
copy, site-directed mutagenesis, and molecular 
dynamic simulations (27, 28). Our experimental 
results are consistent with molecular dynamic 
simulations showing that three waters are suf- 
ficient to close the gap and enable the proton 
transfer step to the SB (28). The positions of the 
three water molecules differ from those of the 
four waters observed in a freeze-trapping struc- 
ture of the Val49Ala mutant (29), and the state 
trapped by mutagenesis lacks the large confor- 
mational changes on the cytoplasmic side. The 
constitutively open triple mutant Asp°°—>Gly/ 
Phe!”—Cys/Phe?°—+Leu, by contrast, allowed 
trapping of N-like features without light activa- 
tion (23). However, altering the side chains of 
these residues prevents formation of the water 
chain in the x-ray structure of the triple mutant 
(23) (fig. S5). Such controversies have long hin- 
dered the emergence of a consensus concerning 
large conformational changes in the late bR pho- 
tocycle (10). The useful mutations by their very 
nature had to target functionally critical regions, 
which complicated interpretation. Our study re- 
solves these discrepancies by showing the rele- 
vant arrangements in light-activated native bR 
measured at physiological temperatures and with 
real-time resolution. 


Molecular mechanism of proton uptake 


The structural transition fits well with the mo- 
lecular mechanism of vectorial proton pump- 
ing (movie $2) (30). Mechanistically, bR can be 
divided into an extracellular half and a cyto- 
plasmic half with retinal positioned at the cen- 
ter (Fig. 2). Light-induced isomerization of the 
retinal chromophore is the first principal step in 
the reaction and provides the energy for proton 
pumping. The energy is used in the second prin- 
cipal step to drive protein conformational changes 
on the extracellular side to transfer a proton from 
the retinal SB toward the extracellular release 
group via a hydrophilic interaction network of 
residues and water molecules. Transferring the 
proton to the release group removes the sub- 
strate from the center of the membrane. 

In the third principal step, the protein has to 
adopt a different conformation to allow the SB 
to accept a proton from the intracellular side of 
the membrane. Preventing proton back leakage 
to the cytoplasmic side is critical for vectorial 
transport against a concentration gradient. In 
bR, this is achieved through a hydrophobic bar- 
rier between the primary acceptor of protons in 
the center (the SB) and the proton donor on the 
cytoplasmic side of the membrane (Asp®’). This 
12-A barrier has to be opened at the right time 
during each cycle to allow the SB to accept a 
new proton and recharge the system. Such a 
reprotonation switch has been suspected to in- 
volve larger conformational changes of the pro- 
tein chain (37) in the late photocycle (i.e., M to N 
transition). Electron crystallography on bR mu- 
tants suggested opening on the cytoplasmic side 
and formation of a water channel to the retinal 
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Helix F 
Helix G 


Fig. 3. Opening of the cytoplasmic side facilitates reprotonation. Rearrangements of helices E, 
F, and G between the closed state (0 to 5 ms) (A) and the open state (10 to 15 ms) (B) lead to 
disordering of the C terminus (green dotted spheres). Charged residues on the cytoplasmic surface 
are shown as sticks. The presumed proton position is indicated on either Asp?° or the SB, and the 
chain of ordered water molecules forming at 10 to 15 ms is shown in spheres. 


(24, 32). The formation of a water chain, oriented 
by the protein environment, could in principal 
act as a Grotthuss proton wire (33), bridging 
the hydrophobic barrier to deliver a new proton 
to the SB. Yet, despite numerous molecular dy- 
namics simulations and structural studies (10), 
it remained unclear whether proton transfer 
occurs via rapidly exchanging waters in a chan- 
nel (23) or via distinct water molecules re- 
arranged by the protein environment (28). In 
addition to differences in methodology, dis- 
crepancies originate from the use of different 
starting models (fig. S5). The transition from 
the closed to the open bR conformation domi- 
nating the 10- to 15-ms timepoint provides direct 
evidence for the transient formation of such a 
chain of water molecules (Fig. 3). The rotation 
of Leu®’, together with Phe”, creates space for 
three water molecules that connect the SB with 
the intracellular proton donor Asp®°, opening 
the hydrophobic barrier by altering the hydro- 
phobic interaction network. Transferring the 
proton to the SB replenishes the substrate at 
the center of the membrane. 

The structural rearrangements that we ob- 
serve leave Asp®® covered by a single hydropho- 
bic lid formed by Phe*”, making it accessible for 
reprotonation. Furthermore, structural changes 
to a cluster of residues acting as a proton funnel 
at the cytoplasmic side of the protein aid re- 
protonation of Asp? (Fig. 3) (34), which may 


transiently proceed via an intracellular uptake 
cluster (35). 

Time-resolved serial crystallography has re- 
vealed the structural reorganizations during 
proton uptake and release in bR with astound- 
ing detail and over many orders of magnitude 
in time. The alternating rearrangements on the 
extracellular and intracellular sides during the 
pumping cycle of bR agree with an alternate 
access model and may provide a template for 
understanding the principal transport steps in 
other membrane transport proteins. The large- 
scale motion of helix F further resembles the 
outward movement of transmembrane helix 6 
in G protein-coupled receptors. Bringing time- 
resolved serial crystallography developed at 
XFELs to the broader scientific community using 
synchrotrons should provide tools to answer many 
other pressing questions in structural biology. 
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Conserved N-terminal cysteine 
dioxygenases transduce responses 
to hypoxia in animals and plants 


Norma Masson", Thomas P. Keeley’*, Beatrice Giuntoli”**, Mark D. White**+, 
Mikel Lavilla Puerta®, Pierdomenico Perata®, Richard J. Hopkinson‘t, 
Emily Flashman‘*§, Francesco Licausi”?§, Peter J. Ratcliffe’ | | 


Organisms must respond to hypoxia to preserve oxygen homeostasis. We identify 

a thiol oxidase, previously assigned as cysteamine (2-aminoethanethiol) dioxygenase 
(ADO), as a low oxygen affinity (high-K,,O2) amino-terminal cysteine dioxygenase 
that transduces the oxygen-regulated stability of proteins by the N-degron pathway 
in human cells. ADO catalyzes the conversion of amino-terminal cysteine to cysteine 
sulfinic acid and is related to the plant cysteine oxidases that mediate responses 

to hypoxia by an identical posttranslational modification. We show in human cells 
that ADO regulates RGS4/5 (regulator of G protein signaling) N-degron substrates, 
modulates G protein-coupled calcium ion signals and mitogen-activated protein 
kinase activity, and that its activity extends to other N-cysteine proteins including 
the angiogenic cytokine interleukin-32. Identification of a conserved enzymatic 
oxygen sensor in multicellular eukaryotes opens routes to better understanding and 
therapeutic targeting of adaptive responses to hypoxia. 


xygen homeostasis is critical for most 
forms of life and is impaired in most hu- 
man diseases. Previous work identified the 
hypoxia-inducible factor (HIF) prolyl hy- 
droxylases as human oxygen sensors (1). 
These enzymes are low oxygen affinity (high- 
Ky,O.), 2-oxoglutarate (2-OG)-dependent dioxy- 
genases that catalyze trans-4-prolyl hydroxylation 
of the transcription factor HIF (2, 3) to target it 
for proteolysis. By this process, these hydroxylases 
regulate a broad range of transcriptional responses 
to hypoxia [reviewed in (4)]. Although the prolyl 
hydroxylation of HIF was unprecedented as a 
signaling mechanism, subsequent work has re- 
vealed different systems of enzymatic protein 
oxidation, which signal hypoxia in representa- 
tives of all four eukaryotic kingdoms (5-7). In 
each system, the protein oxidation event is linked 
to protein degradation. 
Of particular interest is the Cys branch of the 
N-degron pathway (8). After the action of me- 
thionine aminopeptidases, oxidation of N-terminal 
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cysteines creates a substrate for arginyltrans- 
ferases, which catalyze the addition of this 
N-terminal destabilizing residue, promoting deg- 
radation. No cysteine-modifying enzyme was 
defined, but N-terminal cysteine oxidation was 
shown to be affected by nitric oxide (9) and, on 
the basis of in vitro studies, has been considered 
likely to be nonenzymatic. Subsequently, it was 
shown in plants that the Cys branch of the N-degron 
pathway controls the stability of ethylene response 
transcription factors (ERF-VII) (0, 17). Further 
studies in Arabidopsis thaliana revealed that cys- 
teine oxidation is catalyzed by a series of plant 
cysteine oxidases (PCOs), which act as oxygen sen- 
sors directing hypoxic adaptation (7, 12). These 
findings led us to further investigate the mech- 
anism of N-terminal cysteine oxidation in animals. 

First, we created human osteosarcoma U-20S 
and colon cancer RKO cells that stably express a 
fusion protein comprising N-terminal sequences 
that are sufficient for oxygen-dependent deg- 
radation of the ERF-VII transcription factor 
RAP2.12 (Related to APETALA2) in plants, linked 
to a green fluorescent protein (GFP):V5 reporter, 
and exposed these cells to hypoxia. To distinguish 
responses from those transduced by HIF, we also 
tested known inhibitors of the HIF prolyl hy- 
droxylases that differ in their specificity both 
for other iron-dependent dioxygenases and non- 
enzymatic, metal-catalyzed oxidation. Exposure 
of the transfected cells to hypoxia and to the 
nonspecific iron chelator dipyridyl resulted in 
accumulation of the RAP,59:GFP:V5 reporter 
protein, but not that of a C2A mutant, without 


affecting reporter transcript levels (fig. S1). 
By contrast, neither reporter was activated by 
nonspecific 2-OG dioxygenase inhibitors [desfer- 
rioxamine (DFO) and dimethyloxalylglycine 
(DMOG)] or a HIF prolyl hydroxylase inhibitor, 
all of which robustly induced HIF (Fig. 1A). In 
cells exposed to hypoxia for 16 hours and then 
treated with cycloheximide before being reoxy- 
genated or maintained in hypoxia, we found that 
hypoxia prolonged the reporter protein half-life 
from ~5 to 35 min (Fig. 1B and fig. S2). These 
findings demonstrated an iron- and oxygen- 
dependent activity in human cells that is dis- 
tinct from that of the HIF prolyl hydroxylases 
and that operates on amino-acid sequences from 
plant RAP2.12, in a manner dependent on cysteine 
at position 2. 

We next compared this response with that of 
members of the R4 group of RGS proteins, which 
are targets of the Cys branch of the N-degron 
pathway in humans and mice (13, 14). Experi- 
ments on RKO cells stably expressing hemag- 
glutinin (HA)-tagged RGS4 (RGS4:HA) and an 
RGS4:GFP fusion (RGS4;.29:GFP), each encoding 
wild-type or C2A mutant sequences, revealed ac- 
cumulation of wild-type, but not mutant, pro- 
teins in cells exposed to hypoxia and dipyridyl, 
but not DMOG or DFO (Fig. 1C and fig. S3). En- 
dogenous RGS4 and RGS5 proteins in human 
neuroblastoma SH-SY5Y cells responded iden- 
tically to the same set of compounds (Fig. 1D). 
Responses of these RGS proteins to graded hy- 
poxia were further examined in a series of human 
(SH-5Y5Y, RKO, human endothelial EA.hy926) 
and mouse embryonic sarcoma (C3H/10T1/2) cells, 
revealing progressive accumulation of RGS4 or 
RGSS proteins in response to physiological hypoxia 
(Fig. IE and fig. S4). These changes were observed 
at the level of proteins and not mRNAs, with the 
exception of RGS4 in SH-SY5Y. Thus, plant and 
human reporter proteins and endogenous RGS 
proteins appeared to be regulated similarly, sug- 
gesting that human cells might regulate their sta- 
bility using an enzyme(s) similar to the PCOs. 

The human genome contains two thiol di- 
oxygenases with a predicted structure similar 
to that of the PCOs, cysteine dioxygenase (CDO1) 
and an enzyme previously assigned as cysteamine 
(2-aminoethanethiol) dioxygenase (ADO) (15, 16). 
Genes encoding these enzymes and PCO1 were 
cotransfected with a gene encoding RGS4:HA 
into human embryonic kidney 293T cells. Over- 
expressed ADO, but not CDO1, suppressed 
hypoxic induction of RGS4:HA in a manner 
dependent on cysteine at position 2 (Fig. 2A). 
At these levels of overexpression, RGS4:HA was 
not suppressed by PCO1. The ability of ADO to 
suppress RGS4:HA was inhibited by combined 
exposure to hypoxia and dipyridyl and ablated 
by H112A+H114A mutations (fig. $5) that prevent 
assembly of the catalytic iron center (15). These 
experiments indicated that the catalytic activity 
of overexpressed ADO was sufficient to suppress 
RGS4:HA. 

We next inactivated ADO and CDO1 in SH-SY5Y 
and RKO cells using CRISPR/Cas9-mediated gene 
editing. Inactivation of ADO but not CDO1 led 
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to constitutive up-regulation of endogenous and 
transfected RGS4 and RGSS proteins irrespective 
of oxygen levels (Fig. 2B and figs. S6 and $7). In 
view of reported actions of nitric oxide on RGS 
proteins (9, 17), we also tested responses to the 
nitric oxide donor DETA-NO in wild-type and 


Fig. 1. Regulation of plant and animal N-degron 
substrates by oxygen in human cells. (A) Levels 
of fusion proteins linking the N-terminal 1-50 
residues of plant RAP2.12 or a C2A mutant to a 
GFP:V5 cassette [RAP .50:V5; RAP}-50(C2A):V5] in 
stably transfected U-20S cells exposed to hypoxia 
or the indicated inhibitors. (B) RAP -50:V5 reporter 
protein half-life in cells incubated in hypoxia 

(16 hours, 1% Oz), treated with cycloheximide 
(100 uM, 10 min), and then maintained in hypoxia 
or reoxygenated for the indicated times. 

(C) C-terminal HA-tagged human RGS4 (RGS4:HA) 
or a C2A mutant in stably transfected RKO cells 
exposed to hypoxia or inhibitors. (D and E) Endog- 
enous RGS4 and RGS5 proteins in SH-SY5Y cells 
exposed to inhibitors (D) or graded hypoxia (E). 
Similar patterns of response were observed for the 
plant fusion-protein reporter, transfected RGS4:HA, 
and endogenous RGS4/5 proteins; responses of 


exogenous proteins were abolished by C2A mutation. 2,2 DIP, 2,2-dipyridyl (1 
(1 mM); PHI, prolyl hydroxylase inhibitor (125 uM); MG132, proteasomal inhibitor (25 uM). All ex 


unless otherwise indicated. HIF-lo immunoblots are 


ADO-deficient cells. Suppression of hypoxic RGS4: 
levels by DETA-NO in wild-type SH-SY5Y and 
RKO cells was also abrogated in ADO-deficient 
cells (fig. S8). Stable reexpression of ADO, but 
not overexpression of CDO1, suppressed levels 
of RGS proteins (figs. S7 and S9). Under these 


conditions, expression of PCO1 also suppressed 
RGS proteins and restored regulation by oxygen, 
demonstrating oxygen-dependent activity of the 
plant enzyme on endogenous human proteins. In 
SH-SY5Y cells, we then inactivated the arginyl- 
transferase ATE-1 that operates downstream 
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00 uM); DFO, desferrioxamine (100 uM); DMOG, dimethyloxalylglycine 
posures of cells to hypoxia or inhibitors were for 4 hours 


Fig. 2. ADO controls the oxygen-dependent 
Cys branch of the N-degron pathway. 

(A) RGS4:HA and RGS4(C2A):HA protein levels 
in 293T cells after coexpression with control 
(EV), ADO, CDO1, or PCO1. Cells were exposed to 
hypoxia (0.5% Oz, 16 hours) or maintained in air. 
Comparable enzyme levels were confirmed by 
separate FLAG immunoblotting. (B) Endogenous 
RGS4 and RGS5 proteins in ADO-deficient 
SH-SY5Y cells (ADO KO); RGS4 and RGS5 are 
constitutive and insensitive to iron chelators 

or hypoxia. (C) Overexpression of ADO does not 
repress constitutive stabilization of RGS5 in 
ATE1-deficient (ATE1 KO) cells. (D) Expression of 
human ADO restores the wild-type phenotype 

in 4pco A. thaliana mutants; 3pco mutants that 
did not manifest this phenotype were unaffected 
by ADO. (E) Box plots showing the relative 
mRNA level of hypoxia-inducible genes in wild- 
type and pco mutant plants that express ADO. 
ADO significantly reduced expression of 

the hypoxia-inducible genes PDC1, ADH, LBD41, 
and SAD6 in 4pco mutants. Data are shown as 
mean + standard deviation (SD). *P < 0.05; 
Mann-Whitney rank-sum test with levels of non- 
hypoxia-inducible genes unchanged (fig. S10). 
(F) Relative luciferase activity (Fluc/Rluc) in 
Saccharomyces cerevisiae cells expressing 
C-DLOR (Cys) or A-DLOR (Cys to Ala mutant) 
reporter under aerobic and hypoxic conditions in 
the presence or absence of human ADO or 
CDO1. Data are shown as mean + SD. *P < 0.05; 
two-way analysis of variance (ANOVA) followed 
by Holm-Sidak post hoc test. 
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of the proposed cysteine oxidation in the N-degron 
pathway. Up-regulation of RGS5 in ATE1- and 
ADO-deficient cells was similar, and was not 
suppressed by overexpression of ADO in ATE1- 
deficient cells (Fig. 2C). Thus, ADO is required 
for oxygen-dependent degradation of RGS 
proteins. This activity was dependent on the 
integrity of ATE1, consistent with ADO acting 
upstream of ATE] in the N-degron pathway. 

To determine whether ADO can complement 
deficient PCO in plants, we generated a PCO- 
depleted A. thaliana mutant by crossing plants 
in which four of the five known PCO (1, 2, 4, and 
5) genes were inactivated by transferred DNA 
insertional mutagenesis. Homozygous quadruple 
pcol/2/4/5 (4pco), but not triple pcol/2/4 (3pco), 
mutant plants manifested severe developmental 
defects (Fig. 2D) and up-regulation of hypoxia- 
responsive genes under aerobic conditions (Fig. 
2E). When human ADO, but not CDO1, was intro- 
duced into 4pco plants under control of the PCO1 
promoter, the constitutive upregulation of anaer- 
obic genes in air was corrected and the plants 
developed normally (Fig. 2, D and E, and fig. S10). 
Consistent with complementation of defective 
PCO function, coexpression of ADO in A. thaliana 
protoplasts caused dose-dependent suppression 
of RAP2.12 luciferase fusion protein activity 
(fig. S10). 

In budding yeast, cysteine is not a destabilizing 
N-terminal residue (J8). To determine whether 
deficiency of an ADO-like enzyme might be 
responsible for the stability associated with 
N-cysteine in yeast, we introduced ADO or CDO1 
into yeast cells together with a ratiometric re- 
porter in which the activity of a RAP2.12-firefly 
luciferase fusion protein or a C2A mutant is nor- 
malized to Renilla luciferase (fig. S11). Expres- 
sion of human ADO, but not CDO1, reduced the 
activity of, and conferred hypoxic regulation 
on, the RAP»-.s-FLuc protein, but not a C2A 
mutant (Fig. 2F and fig. S11). Consistent with 
this, phylogenetic analyses revealed poten- 
tial ADO and CDO orthologs across most plants, 
animals, and at least some protist species but 
not fungi (fig. S12). Together, these findings ex- 
plain the lack of activity of cysteine as a destabi- 
lizing residue in yeast but suggest that the 
pathway might otherwise operate widely in 
eukaryotic species. 

Cross-complementation suggested that ADO 
catalyzes a form of N-terminal cysteine dioxygen- 
ation similar to that catalyzed by the PCOs. To 
test this, we produced recombinant human ADO 
and CDO1 in E. coli, reacted these enzymes with 
synthetic peptides corresponding to residues 2 to 
15 of human RGS4 and RGSS, and examined the 
products by mass spectrometry (MS). We found 
that the peptides were modified by +32 Da mass 
addition by ADO, but not CDO1 (Fig. 3A and 
fig. S13), and that this modification was sup- 
pressed in anaerobic conditions (Fig. 3B). To 
confirm dioxygenation, we conducted the reac- 
tions in an atmosphere of 80, or in the presence 
of '80-labeled water (H.'*O). These experiments 
revealed a single +36 Da mass addition in 0, 
and a single +32 Da mass addition in the pres- 
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ence of '8O-labeled water, demonstrating that 
two oxygen atoms were incorporated directly 
into the peptide from molecular oxygen and con- 
firming dioxygenation (Fig. 3B). MS2 assigned 
the modification to the N-terminal cysteine 
(fig. S14). Thus, human ADO catalyzes the di- 
oxygenation of N-cysteine residues in RGS4 
and RGS5 to cysteine sulfinic acid. Kinetic mea- 
surements on human ADO (Fig. 3C and fig. S15) 
revealed high turnover (X,at) values of 20.1 and 
16.9 s ' on RGS4 and RGS5 peptides under at- 
mospheric conditions, respectively, but marked 
sensitivity to oxygen (apparent K,,O, > 500 uM). 
Thus, ADO resembles the HIF prolyl hydroxy- 
lase enzymes in manifesting a K,,O, that is sig- 
nificantly above the physiological range, a 
property that may underpin a role in oxygen 
homeostasis. Given the original assignment of 
ADO as a ccysteamine dioxygenase, we also exam- 
ined competition of N-cysteine peptide dioxy- 
genation by free cysteamine and cysteine, but 
found inhibition only at high concentrations 
of these metabolites (ICs9, 37 and 13 mM, re- 
spectively, fig. S16). 


A 
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KO2"PP(%) 55.1 + 6.6 47.1+5.01 


RGS4 and RGS5 regulate heterotrimeric 
G protein signaling by enhancing Ga-coupled 
guanosine 5’-triphosphate hydrolysis and hence 
attenuating G protein signals. Because the cat- 
alytic activity of ADO lowers the levels of RGS4 
and RGS5, ADO-deficient cells in which levels 
of these proteins are increased should manifest 
attenuation of G protein signaling on relevant 
pathways. Go proteins can regulate the activity 
of mitogen-activated protein kinase (MAPK) 
pathways (14, 19). Consistent with this, mouse 
cells and embryos with a defective N-degron 
pathway due to loss of the arginyltransferase 
ATE1 have been shown to exhibit reduced ac- 
tivation of MAPK kinase (J4). We therefore as- 
sayed phosphorylation of MAPK (p44/p42) in 
ADO-deficient SH-SY5Y cells (Fig. 4A). These 
experiments revealed reduced levels of phospho- 
rylated MAPK in ADO-deficient SH-SY5Y cells 
that were similar to levels of phosphorylated 
p44/p42 in ATE1-deficient SH-SY5Y cells. Reduc- 
tion in phosphorylated p44/p42 was reversed by 
expression of ADO in ADO-knockout (KO), but 
not ATE1-KO, cells (Fig. 4B). To test the effects 
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Fig. 3. ADO catalyzes the dioxygenation of the N-terminal Cys of RGS4/5 peptides. (A) MS 
results showing a mass shift of +32 Da when RGS5 N-terminal peptide was incubated with 
recombinant human ADO, but not with recombinant human CDO1. Similar results were obtained 
when an RGS4 N-terminal peptide was used (fig. S13). (B) The ADO-catalyzed +32 Da mass addition 
is absent when reactions were conducted under anaerobic (100% N>) conditions; !°O labeling 
demonstrates incorporation of two oxygen atoms derived directly from molecular O2 and not H20. 
(C) Summary table of reaction kinetics for ADO-catalyzed dioxygenation of N-terminal RGS4/5 
peptides. The influence of varying peptide concentration under atmospheric conditions (keatPep and 
K,,Pep??) and Oz levels using a fixed, nonlimiting concentration of peptide (kga,O2 and K,,027P?) 
were examined to determine sensitivities to both substrates. m/z, mass-to-charge ratio. Source data 


are provided in fig. S15. 
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of ADO on responses to a specific G protein- 
coupled agonist, we examined carbachol, a cho- 
linergic agonist whose muscarinic receptor is 
coupled via Gaq to the regulation of intracel- 


lular Ca?*. Attenuation of Ca?* mobilization in 
response to carbachol (Fig. 4C), but not to the 
receptor-independent ionophore ionomycin 
(Fig. 4D), was observed in ADO-deficient cells 
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Fig. 4. ADO regulates G protein signaling. ADO regulates G protein signaling in SH-SY5Y cells. 
(A) MAPK (p44/42) phosphorylation in wild-type, ADO KO, and ATE1 KO cells. Immunoblot lanes 
represent separate biological replicates, with densitometric analysis provided below. Data are shown 
as mean + SD. n = 3 independent clones. ***P < 0.001, one-way ANOVA with Holm-Sidak post hoc 

test. (B) Reexpression of ADO increases phosphorylated p44/42 in ADO KO, but not ATE1 KO cells. Data 
are shown as mean + SD. n = 3 independent experiments. **P < 0.01, two-way ANOVA with Holm-Sidak 
post hoc test. (C) Carbachol (CCh)-stimulated rises in [Ca**], are attenuated in ADO-deficient (KO) 
compared with wild-type (WT) cells. A representative trace is provided and mean peak change in 
R405/495 intensity at each CCh concentration is shown (inset). n = 8 to 12 individual cells. ***P < 0.001, 
three-parameter nonlinear regression analysis. (D) lonomycin (0.1 uM) is equipotent at stimulating 
Ca** release in ADO KO cells infected with either control (EV) or ADO-containing lentivirus, 
whereas responses to CCh are recovered by ADO reexpression. Data are shown as mean + SD. 

n = 6 to 7 individual cells. *P < 0.05, two-way ANOVA with Holm-Sidak post hoc test. 


Fig. 5. IL-32 is a target of ADO-catalyzed A 
N-terminal cysteine dioxygenation. (A) MS 

analyses of the indicated IL-32 N-terminal peptide 
incubated aerobically with or without recombinant 

ADO (1 hour at 37°C), showing a +32 Da shift 

when incubated with ADO, indicative of the addition 0 
of Oo. The small peak at ~845 m/z in the absence 

of ADO (top panel) was confirmed to correspond to 100 
a potassium adduct of the unoxidized peptide. 

(B) IL-32, but not asparagine synthetase (ASNS) or 
JunB, is regulated by hypoxia (1% Oz, 4 hours) and 
ADO in RKO cells. (©) ADO-dependent regulation 
of IL-32 by hypoxia is observed at the protein but not 
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and was reversed by reexpression of ADO. Given 
the complexity of interactions among RGS pro- 
teins and G protein-signaling pathways, we can- 
not be certain that these effects are entirely 
caused by effects of ADO on RGS4 and RGS5 
proteins. Nevertheless, this study establishes 
a role for ADO in the regulation of G protein 
signaling, consistent with its role as a cysteine- 
modifying enzyme in the N-degron pathway 
regulating RGS proteins. 

N-terminal sequence analyses of proteins en- 
coded by plant and animal genomes have sug- 
gested the existence of many other potential 
substrates for the Cys branch of the N-degron 
pathway (7, 9), and ADO is more widely ex- 
pressed in human cells and tissues than is RGS4/5 
(20). We therefore sought to determine whether 
ADO-mediated, oxygen-dependent regulation of 
human proteins extended beyond the identified 
RGS proteins. To pursue this, we first reacted 
recombinant ADO with a diverse series of pep- 
tides derived from proteins predicted to be pro- 
cessed to generate N-cysteine polypeptides and 
then measured dioxygenation (+32 Da mass shift) 
by MS. These experiments revealed substrate- 
dependent catalytic activity of ADO ranging from 
near zero to levels that were similar to those in 
experiments using RGS5 peptide (figs. S17 and 
$18). We then examined endogenous protein lev- 
els corresponding to peptide substrates that sup- 
ported high [interleukin-32 (IL-32), Fig. 5A] or 
very low [asparagine synthetase (ASNS) and 
JunB] ADO-catalyzed dioxygenation using avail- 
able antibodies. These experiments were conducted 
in the previously engineered ADO-deficient RKO 
cells because IL-32 was not detected in SH-SY5Y 
cells. Immunoblotting revealed that the abun- 
dance of IL-32, but not ASNS or JunB, was 
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increased in hypoxic cells, accumulated constitu- 
tively in ADO-deficient cells, and was reduced by 
reexpression of transfected ADO (Fig. 5B). Exper- 
iments in wild-type and ADO-deficient RKO 
cells confirmed IL-32 regulation at the pro- 
tein but not the mRNA level (Fig. 5C). Further 
experiments confirmed dioxygenation of the 
N-terminal cysteine after reaction of the IL-32 
peptide with recombinant ADO (fig. $18) and 
showed that this residue was necessary for ADO- 
mediated suppression of cotransfected IL-32 in 
cells (Fig. 5D). These findings demonstrate that 
ADO target proteins do extend beyond RGS pro- 
teins and identify human IL-32 as one such pro- 
tein. The tested peptides represent only a small 
fraction of N-cysteine polypeptides that might be 
generated in cells and it is therefore likely that 
other ADO-regulated human targets exist. The 
transcriptional regulator LITTLE ZIPPER 2 and 
a component of Polycomb Repressor 2 Complex, 
VERNALIZATION 2, have recently been identified 
as new oxygen-regulated targets of the Cys branch 
of the N-degron pathway in plants (27, 22). Given 
the emerging complexity of N-degron regulation 
(8), in which other processes may compete with 
ADO-catalyzed dioxygenation, it cannot be as- 
sumed that high levels of ADO catalysis on 
isolated peptides will necessarily predict phys- 
iological regulation by ADO, or indeed that all 
protein regulation by ADO operates through the 
same downstream pathways. Identification of 
human ADO as an enzymatic human oxygen 
sensor should open the way to understanding 
responses to hypoxia that are transduced by 
these pathways. 

Conservation of ADO and the PCOs as human 
and plant oxygen sensors contrasts with the ab- 
sence of conservation of their known substrates 
and with different challenges to oxygen homeosta- 
sis that are encountered by animals and plants. 
In plants, the ERF-VII pathway directs tran- 
scriptional responses to hypoxia that require 
time for the transcriptional output to engage 
adaptive responses. In animal cells, the prin- 
cipal process regulating transcriptional responses 
to hypoxia is the prolyl hydroxylation of HIF. By 
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contrast, direct operation of ADO on the protein 
stability of signaling molecules has the potential 
to transduce more rapid responses to hypoxia 
than those mediated by the transcriptional out- 
put of HIF. RGS4 and RGS5 have been impli- 
cated in oxygen homeostasis in mammals through 
effects on the cardiovascular system and angio- 
genesis (13, 19, 23). IL-32 is an atypical cyto- 
kine that regulates proinflammatory cytokine 
networks and angiogenic growth factors (24, 25). 
Although our findings identify ADO as an essen- 
tial regulator of the responses of these proteins 
to hypoxia, it is of interest that RGS4, RGS5, and 
IL-32 have all been reported to be transcriptional 
targets of HIF (26-28). Consistent with the re- 
ported cell-type-specific regulation of RGS4 by 
HIF (26), we did observe induction of RGS4 
mRNA by hypoxia in SH-SY5Y cells, but not in 
other cells. These findings predict that in spe- 
cific cellular settings in which both systems are 
operative, the ADO and HIF prolyl hydroxylase 
systems will interact to generate physiological 
responses to hypoxia. In conclusion, our work 
defines an enzymatic human oxygen sensor, most 
likely operating physiologically on a shorter time 
scale than the transcriptional responses trans- 
duced by the HIF prolyl hydroxylases, and opens 
a new route to the investigation of adaptive re- 
sponses to hypoxia, potentially including their 
therapeutic augmentation by catalytic inhib- 
itors of ADO. 
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Civic honesty around the globe 


Alain Cohn™, Michel André Maréchal®*, David Tannenbaum’, Christian Lukas Ziind? 


Civic honesty is essential to social capital and economic development but is often in 
conflict with material self-interest. We examine the trade-off between honesty and 
self-interest using field experiments in 355 cities spanning 40 countries around the 
globe. In these experiments, we turned in more than 17,000 lost wallets containing varying 
amounts of money at public and private institutions and measured whether recipients 
contacted the owners to return the wallets. In virtually all countries, citizens were more likely 
to return wallets that contained more money. Neither nonexperts nor professional economists 
were able to predict this result. Additional data suggest that our main findings can be explained 
by a combination of altruistic concerns and an aversion to viewing oneself as a thief, both of 
which increase with the material benefits of dishonesty. 


onest behavior is a central feature of eco- 

nomic and social life (7, 2). Without hon- 

esty, promises are broken, contracts go 

unenforced, taxes remain unpaid, and gov- 

ernments become corrupt. Such breaches 
of honesty are costly to individuals, organiza- 
tions, and entire societies. For example, losses 
due to tax evasion in the United States are esti- 
mated in the hundreds of billions of dollars each 
year (3), and the cost of corruption and other illicit 
financial flows in developing countries has been 
estimated at up to US$1.3 trillion annually—an 
amount roughly equal in size to the gross domes- 
tic product of Australia (4, 5). 

In this Report, we examine how acts of civic 
honesty, where people voluntarily refrain from 
opportunistic behavior, are affected by monetary 
incentives to act otherwise. Although there is 
robust experimental literature on the conditions 
that give rise to honest behavior (6-11), little is 
known about how material incentives affect 
civic honesty, particularly in field settings. 
Understanding the relationship between civic 
honesty and material incentives is not only prac- 
tically relevant but also theoretically important. 

Theories of honesty make different predic- 
tions about the role of material incentives. Classic 
economic models based on rational self-interest 
suggest that, all else being equal, honest behavior 
will become less common as the material incen- 
tives for dishonesty increase (12). Models of 
human behavior that incorporate altruistic or 
other-regarding preferences also predict that 
dishonesty will rise with increasing incentives, as 
self-interest virtually always dominates over con- 
cerns for the welfare of others—we care about 
others but not as much as we care about our- 
selves (13-15). As a result, self-interest will play 
an increasingly prominent role in behavior as 
the material incentives for dishonesty grow. 
Psychological models based on self-image main- 
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tenance predict that people will cheat for profit 
so long as their behavior does not require them 
to negatively update their self-concept (7, 16). 
However, it is unclear ex ante whether self-image 
concerns will become more or less important 
as the incentives for dishonesty increase and 
also what form that relationship will take. A fur- 
ther complication is that most of the experimen- 
tal literature on honest behavior involves modest 
financial stakes, has been conducted in labora- 
tory settings (where people understand their be- 
havior is being observed), and tends to rely on 


populations from Western, educated, industri- 
alized, rich, and democratic societies (17). 

We conducted a series of large-scale field ex- 
periments across the globe to examine how fi- 
nancial incentives influence rates of civic honesty. 
We turned in “lost” wallets and experimentally 
varied the amount of money left in them, which 
allowed us to determine how monetary stakes 
affect return rates across a broad sample of so- 
cieties and institutions. Our experiments take 
inspiration from classic “lost letter” studies, which 
examine behavior in naturalistic settings, but 
provide tighter experimental control than past 
studies (78, 19). 

We visited 355 cities in 40 countries and turned 
in a total of 17,303 wallets. We typically targeted 
five to eight of the largest cities in a country, with 
roughly 400 observations per country. Wallets 
were turned in to one of five types of societal 
institutions: (i) banks; (ii) theaters, museums, or 
other cultural establishments; (iii) post offices; 
(iv) hotels; and (v) police stations, courts of law, 
or other public offices. These institutions serve as 
useful benchmarks because they are common 
across countries and typically have a public recep- 
tion area where we could perform the drop-offs. 

Our wallets were transparent business card 
cases, which we used to ensure that recipients 
could visually inspect without having to phys- 
ically open the wallet (fig. S1). Our key indepen- 
dent variable was whether the wallet contained 
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Fig. 1. Share of wallets reported in the NoMoney and Money conditions, by country. (Left) Share 
of wallets reported in NoMoney (US$0) and Money (US$13.45) conditions, by country. The amount 
of money in the wallet is adjusted according to each country’s purchasing power. (Right) Average 
difference between Money and NoMoney conditions across quartiles based on absolute reporting 
rates in the NoMoney condition. Error bars represent standard errors of the mean. 
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money, which we randomly varied to hold either 
no money or US$13.45 (“NoMoney” and “Money” 
conditions, respectively). We used local curren- 
cies and, to ensure comparability across coun- 
tries, adjusted the amount according to each 
country’s purchasing power. Each wallet also 
contained three identical business cards, a gro- 
cery list, and a key. The business cards displayed 
the owner’s name and email address and we 
used fictitious but commonplace male names for 
each country. Both the grocery list and business 
cards were written in the country’s local language 
to signal that the owner was a resident. 

After walking into the building, one of our re- 
search assistants (from a pool of 11 male and 2 
female assistants) approached an employee at the 
counter and said, “Hi, I found this [pointing to 
the wallet] on the street around the corner.” The 
research assistant then placed the wallet on the 
counter and pushed it over to the employee, say- 
ing, “Somebody must have lost it. ’m in a hurry 
and have to go. Can you please take care of it?” 
The assistant then exited the building without 
leaving contact details or requesting written proof 
of having turned in the wallet. Our key outcome 
measure was whether recipients contacted the 
owner to return the wallet. We created a unique 
email address for each wallet and recorded emails 
that were sent within 100 days of the initial drop- 
off. Complete methods and results, including 
additional robustness checks such as testing for 
experimenter effects, can be found in the supple- 
mentary materials. 

As shown in the left panel of Fig. 1, our cross- 
country experiments return a remarkably con- 
sistent result: citizens were overwhelmingly more 
likely to report lost wallets containing money 
than those without. We observed this pattern for 
38 of our 40 countries, and in no country did we 
find a statistically significant decrease in report- 
ing rates when the wallet contained money. On 
average, adding money to the wallet increased 
the likelihood of being reported from 40% in 
the NoMoney condition to 51% in the Money 
condition (P < 0.0001). This result holds when 
controlling for a number of recipient and situa- 
tional characteristics (table S8). Furthermore, 
although rates of civic honesty vary substantially 
from country to country, the absolute increase in 
honesty across conditions was stable. As shown 
in the right panel of Fig. 1, the average treatment 
effect is roughly equal in size across quartiles 
based on absolute reporting rates. 

Citizens displayed greater civic honesty when 
the wallets contained money, but perhaps this is 
because the amount was not large enough to be 
financially meaningful. To examine this possibil- 
ity, we also ran a “BigMoney” condition in three 
countries (the United States, the United Kingdom, 
and Poland) that increased the money inside the 
wallet to US$94.15, or seven times the amount in 
our original Money condition. As shown in Fig. 2, 
reporting rates in all three countries increase 
even further when the wallets contained a sizable 
amount of money. Pooled across the three coun- 
tries, reporting rates increased from 46% in the 
NoMoney condition to 61% in the Money condition 
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and topped out at 72% in the BigMoney condition 
(P < 0.0001 for all pairwise comparisons) (table S9). 

We next turn to the question of why people are 
especially likely to return a lost wallet when it 
contains more, rather than less, money. Our study 
design allows us to rule out several possible ex- 
planations. We first explored the possibility that 
recipients were worried about legal penalties for 
failing to return a wallet, especially when the wal- 
let contained larger amounts of money. To ad- 
dress this issue, we examined whether relative 
reporting rates were affected by (i) the presence 
of other individuals when receiving the lost wal- 
let, (ii) the presence of security cameras in the 
building, and (iii) state-level variation in lost 
property laws in the United States. Civic honesty 
should increase as a function of these variables if 
recipients are concerned about possible punish- 
ment or the probability of detection, yet we find 
that none of these factors explain meaningful 
variation in reporting rates across treatment con- 
ditions (tables S14 to S16). A second explanation 
is that because we only measured whether recip- 
ients reported a lost wallet, recipients in the money 
conditions may have been more likely to return 
the wallets while pocketing the cash. We conduc- 
ted an audit on a subset of wallets reported to us 
and did not find support for this explanation: more 
than 98% of the money in the wallets we collected 
was returned. A third possible explanation is that 
recipients expected a bigger finder’s fee upon re- 
turning wallets with larger amounts of money. In 
national representative surveys conducted in the 
United States, the United Kingdom, and Poland, 
we asked respondents what size of reward they 
would expect upon returning a wallet with the 
amounts of money we used in our studies. We 
fail to find evidence that people expect a larger 
reward for returning a wallet with more money 
in it (table S17). 

Having ruled out these three possible explan- 
ations, we next formulate and test a simple be- 
havioral model that captures the pattern of results 
observed in the data (full model details can be 
found in the supplementary materials). In our 
framework, civic honesty is determined by the 
interplay between four components: (i) the eco- 
nomic payoff of keeping the wallet, (ii) the fixed 
effort cost of contacting the wallet’s owner, (iii) 
an altruistic concern for the owner’s welfare, and 
(iv) the costs associated with negatively updating 
one’s self-image as a thief (what we call theft 
aversion). 

A key feature of our framework is that al- 
truistic concerns are affected by the contents of 
the wallet thought to be valuable to the owner, 
whereas concerns of theft aversion are only af- 
fected by the contents of the wallet that are also 
valuable to the recipient (e.g., money). To dis- 
tinguish between these two motivations, we 
conducted a “Money-NoKey” condition in our 
U.S., U.K., and Poland locations with wallets 
identical to our Money condition but which did 
not contain a key. Unlike money, the key is val- 
uable to the owner but not to the recipient, and 
so any difference between the Money and Money- 
NoKey conditions can be ascribed to altruistic 
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Fig. 2. Reporting rates as a function of 
monetary stakes. Share of wallets reported 
in the NoMoney (US$0), Money (US$13.45), 
and BigMoney (US$94.15) conditions. 


concerns. As shown in table S10, recipients were, 
on average, 9.2 percentage points more likely 
to report a wallet with a key than one without 
(P = 0.0001 when results are pooled across coun- 
tries). This suggests that recipients reported a 
lost wallet partly because they were concerned 
about the harm they would impose on the owner 
by not reporting it. 

The second part of our framework—which is 
crucial to explaining the increase in reporting 
rates for wallets with larger amounts of money— 
involves the aversion to viewing oneself as a thief. 
Using nationally representative surveys con- 
ducted in the United States, the United Kingdom, 
and Poland, we asked respondents to imagine 
receiving a wallet from one of our four condi- 
tions (NoMoney, Money, BigMoney, and Money- 
NoKey) and to rate the extent to which failing 
to return that wallet would feel like stealing on 
a scale from O (not at all) to 10 (very much). 
Respondents reported that failing to return a 
wallet would feel more like stealing when the 
wallet contained a modest amount of money than 
when it contained no money and that such be- 
havior would feel even more like stealing when 
the wallet contained a substantial amount of 
money (P < 0.007 for all pairwise comparisons) 
(table S11). This tells us that the self-image cost 
of failing to return the wallet likely increases 
with the amount of money in the wallet, which 
is consistent with our behavioral data on wallet 
reporting rates. By contrast, we fail to observe a 
reliable difference in “feels like stealing” scores 
when comparing wallets that contained the 
same amount of money but differed in whether 
they also contained a key (Money versus Money- 
NoKey; P = 0.259). This tells us that concerns of 
theft aversion are likely tied to contents that are 
valuable to the recipient, such as the amount of 
money inside the wallet, but not to other con- 
tents that are only valuable to the owner. Although 
survey responses do not always generalize to real 
behavior and should be interpreted carefully, 
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Fig. 3. Actual versus predicted reporting rates. (A) Actual reporting rates in the United States 
for each condition (n = 800 observations). Error bars represent robust standard errors. (B) Average 
predicted reporting rates for the United States by our nonexpert sample (n = 299 individuals). Error bars 
represent robust standard errors clustered by participants. (©) Average predicted reporting rates 

for the United States by our expert sample of academic economists (n = 279 individuals). Error bars 
represent robust standard errors clustered by participants. 


these findings are consistent with the hypoth- 
esis that larger monetary payoffs for dishonesty 
are also associated with increased psychological 
costs and that the increase in psychological costs 
can outweigh the marginal economic benefits of 
dishonesty. 

In a final set of studies, we investigated 
whether people anticipate this form of civic hon- 
esty. We asked a sample of 299 participants to 
predict reporting rates in the United States for 
wallets containing US$0, US$13.45, and US$94.15 
(corresponding to our NoMoney, Money, and 
BigMoney conditions). To encourage accuracy, 
we notified respondents that the most accurate 
among them would be awarded a cash bonus. 
As shown in Fig. 3B, we find that respondents’ 
beliefs were at odds with the behavioral data 
(Fig. 3A). Respondents predicted that rates of 
civic honesty would be highest when the wallet 
contained no money (mean predicted reporting 
rate M = 73%, SD = 29), lower when the wallet 
contained a modest amount of money (M = 65%, 
SD = 24), and lower still when the wallet con- 
tained a substantial amount of money (M = 55%, 
SD = 29). The average predicted change in re- 
porting rates from condition to condition was 
significantly different from the actual change in 
reporting rates (P < 0.001 for all pairwise compar- 
isons). As the amount of money increased, 64% 
of respondents incorrectly predicted that reporting 
rates would decrease and 18% correctly predicted 
that reporting rates would increase (P < 0.001 by 
a sign test). Additional questioning suggests that 
respondents’ predictions reflected a mental model 
of human behavior that exaggerates the role of 
narrow self-interest (20, 27). When wallets con- 
tained more money, respondents expected self- 
interest to grow and altruistic concerns for the 
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owner to fade, and they gave little weight to the 
influence of theft aversion on reporting rates (see 
table S13). 

The general public incorrectly predicts how 
citizens will respond as the monetary value of the 
wallet increases, but perhaps professional econ- 
omists will be more accurate. We asked a sample 
of 279 top-performing academic economists to 
make the same set of predictions. Like our non- 
experts, this sample also did not expect reporting 
rates to increase for wallets with larger amounts 
of money. As shown in Fig. 3C, respondents on 
average predicted that rates of civic honesty 
would be higher in the NoMoney and Money 
conditions (M = 69%, SD = 25 and M = 69%, SD = 
21, respectively) than in the BigMoney condition 
(M = 66%, SD = 23). These predictions were 
again significantly different from the actual 
changes we observe across conditions (P < 0.001 
for all pairwise comparisons). However, the de- 
gree of miscalibration among economists was 
less severe than in our nonexpert sample. As 
the amount of money increased, 49% of econo- 
mists incorrectly predicted that reporting rates 
would decrease and 29% correctly predicted that 
reporting rates would increase (P < 0.001 by a 
sign test). 

We conducted field experiments in 40 coun- 
tries to examine whether people act more dis- 
honestly when they have a greater economic 
incentive to do so, and we found the opposite to 
be true. Citizens were more likely to return wal- 
lets that contained relatively larger amounts of 
money. This finding is robust across countries 
and institutions and holds even when economic 
incentives for dishonesty are substantial. Our re- 
sults are consistent with theoretical models that 
incorporate altruism and self-image concerns, 


but they also suggest modification in that non- 
pecuniary motivations directly interact with the 
material benefits gained from dishonest behav- 
ior. When people stand to heavily profit from 
engaging in dishonest behavior, the desire to 
cheat increases but so do the psychological costs 
of viewing oneself as a thief—and sometimes the 
latter will dominate the former. 

Our findings also represent a distinctive data- 
set for examining cross-country differences in 
civic honesty. Honesty is a key component of 
social capital (22), and here we provide an ob- 
jective measure to supplement the large body of 
work that has traditionally examined social capi- 
tal using subjective survey measures (2, 23-25). 
Using average reporting rates across countries, 
we find substantial variation in rates of civic 
honesty, ranging from 14 to 76%. This variation 
largely persists even when controlling for a coun- 
try’s gross domestic product, suggesting that 
other factors besides a country’s wealth are also 
at play. In the supplementary materials, we pro- 
vide an analysis suggesting that economically 
favorable geographic conditions, inclusive polit- 
ical institutions, national education, and cultural 
values that emphasize moral norms extending 
beyond one’s in-group are also positively asso- 
ciated with rates of civic honesty. Future re- 
search is needed to identify how these and other 
factors may contribute to societal differences in 
honest behavior. 
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LIGHT METALS 


Large plasticity in magnesium 
mediated by pyramidal dislocations 


Bo-Yu Liu™, Fei Liu’*, Nan Yang’, Xiao-Bo Zhai”, Lei Zhang’, Yang Yang“, Bin Li*}, 
Ju Li’, Evan Ma®, Jian-Feng Nie”*+, Zhi-Wei Shan'+ 


Lightweight magnesium alloys are attractive as structural materials for improving energy 
efficiency in applications such as weight reduction of transportation vehicles. One 
major obstacle for widespread applications is the limited ductility of magnesium, which 
has been attributed to (c + a) dislocations failing to accommodate plastic strain. We 
demonstrate, using in situ transmission electron microscope mechanical testing, that 
(c +a) dislocations of various characters can accommodate considerable plasticity 
through gliding on pyramidal planes. We found that submicrometer-size magnesium 
samples exhibit high plasticity that is far greater than for their bulk counterparts. Small 
crystal size usually brings high stress, which in turn activates more (c + a) dislocations in 
magnesium to accommodate plasticity, leading to both high strength and good plasticity. 


agnesium is the lightest structural metal, 

with a density about 35% and 77% less 

than that of aluminum and steel, respec- 

tively (1). Magnesium alloys are actively 

being developed because of their poten- 
tial usefulness for improving energy efficiency 
across the automobile, aircraft, and aerospace 
industries, in which the weight savings translate 
to lower energy consumption. However, the gen- 
erally limited ductility of Mg at room tempera- 
ture makes the processing and forming of profiles 
and components difficult and costly. Consequen- 
tially, low ductility has become one major obsta- 
cle that hampers the widespread applications of 
Mg products. 

The ductility of Mg is intimately related to 
the fundamental behaviors of pyramidal (c + a) 
dislocations (fig. S1), which are the major con- 
tributor to c-axis strain (2, 3). High ductility of 
Mg should therefore be achievable by generat- 
ing more (c+ a) dislocations (4-7). However, 
(c + a) dislocations are thought to be intrinsi- 
cally unstable by readily transforming into ses- 
sile structures that cannot contribute to plastic 
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Fig. 1. In situ TEM compression test showing that dislocation slip is 
responsible for the plastic deformation of an Mg single-crystal pillar under 
c-axis compression. (A) Hexagonal unit cell showing the loading orientation. 
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Engineering strain (%) 


strain (8-10). In light of this generally accepted 
understanding, proposed alloy design strategies 
primarily stabilize the (c + a) dislocation and 
prevent the glissile-to-sessile transformation (11). 
The glissile-to-sessile transformation is not ob- 
served in some recent simulation studies (12-14), 
in which (c + a) dislocations glide on pyramidal 
planes, even though the actual slip plane is under 
debate (15, 16). Controversy surrounds the fun- 
damental behavior of (c + a) dislocations, such 
as their ability to accommodate plastic strain 
and their slip pathways. This creates difficul- 
ties in rationalizing the mechanical behavior and 
for alloy design. We exploited in situ transmis- 
sion electron microscope (TEM) mechanical 
testing (17, 18), three-dimensional (3D) image 
reconstruction, and atomistic simulations to 
resolve the prevailing uncertainties. Our results 
document large plastic strains mediated by abun- 
dant (c + a) dislocations gliding on both pyram- 
idal I {1011} and pyramidal II {1122} planes. 
We performed in situ TEM mechanical test- 
ing at room temperature on submicrometer-size 
pillars of Mg single crystals (table S1). The pillars 
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were fabricated by focused ion beam milling and 
tested inside TEM (fig. S2). We compressed the 
pillars along their c axis (Fig. 1 and movie S1), 
with the misalignment angle at less than 5° In 
this condition, the (c) or (a) dislocation slip and 
{1012} deformation twinning are all difficult to 
generate. We also conducted controlled experi- 
ments with the electron beam switched off and 
confirmed that the electron beam we used had 
no obvious effect on the mechanical behavior 
of the tested samples (fig. S3). We performed 
g- b analyses (where g is the diffraction vector 
and b is the Burgers vector) to determine the 
Burgers vector of dislocations (79). 

All the pillars we tested underwent uniform de- 
formation and exhibited fairly large dislocation- 
mediated plastic strains without failure (Fig. 1 
and fig. $4). Dislocations were generated succes- 
sively from the top region of the pillar, propagated 
gradually toward the bottom part of the pillar. 
With further deformation of the pillar, individual 
dislocations became difficult to image because 
their density was too high (Fig. 1C); therefore, we 
found it hard to analyze the Burgers vectors of 
these dislocations. To circumvent this difficulty, 
we used trapezoidal-shaped samples to generate 
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(B) Stress-strain curve. (C) Snapshots showing an increase in dislocation density 
during compression. The dark-field TEM observation is conducted under a two- 
beam condition. Electron beam direction ~[2110] (a axis). e, engineering strain. 
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a stress gradient from the sample top to root 
(Fig. 2). During compression, we retracted the 
flat punch once dislocations appeared at the 
root (movie $2). Although the top was severely 
deformed, and its image contrast was complex, 
dislocations in regions near the sample root were 
all clearly visible. The Burgers vectors of these 
dislocations have both (c) and (a) components, 
and hence they are (c+ a). Presumably, the 
generation and slip of the (c + a) dislocations ef- 
fectively accommodates the plastic strain. 

The (c + a) dislocations we observed in our 
study usually exhibited half-loop and zig-zag 
configurations (fig. S5A), similar to the (c + a) 
dislocations observed in bulk Mg (4, 20-22). We 
believe the existence of such configurations sug- 
gests that (c + a) dislocations have both edge 
and screw characters and are thus of mixed type. 
The half loop shown in Fig. 3A formed at the top- 
right corner of the pillar and expanded con- 
tinuously toward the lower-left corner during 
straining until it reached the pillar surface (fig. 
S6 and movie S3). This observation indicated 
to us that the (c+ a) mixed dislocations are 
glissile, contributing to the plastic deformation, 
implying that (c + a) edge and screw dislocations 
are also glissile. For some (c + a) dislocations, 
some of their segments lie parallel to the inter- 
section of pyramidal and basal planes and are 
also perpendicular to the Burgers vector. Hence, 
they are of edge type (see geometry analyses in 
fig. S5). Such edge segments are glissile (Fig. 3B 
and movie S4). Examination of other half loops 
and edge segments indicated that they are all 
glissile (fig. S7 and movie S5). Moreover, we ob- 
served reversible motion of (c + a) dislocations 
under cyclic loading (fig. S8 and movie S6). This 
indicates that the (c + a) dislocations retained 
their identity and mobility rather than becom- 
ing sessile. The mobility of the (c + a) disloca- 
tions was further supported by our atomistic 
simulation in which (c+ @) dislocations nu- 
cleated during c-axis compression and glided 
on the pyramidal planes (figs. S9 and S10 and 
movie S7). 

The straight segments lying parallel to the 
pyramidal-basal intersection were reported pre- 
viously (20). The presence of such long segments 
has been attributed to low mobility of (c + a) edge 
dislocations (20), formation of sessile dislocation 
locks along the pyramidal-basal intersection (23), 
and dissociation of (c + a) dislocations into par- 
tials and basal stacking fault (8). Here, we pro- 
pose that such rectilinear configuration can also 
result from the formation of dislocation dipole 
(Fig. 3C and movie S5). A straight dislocation 
dipole can form when a dislocation is pinned 
(marked by the yellow cross). The arrangement 
of the dipole and its two neighboring segments, 
land 2, gave rise to an € shape. Under the applied 
stress, segments 1 and 2 glided toward the left, 
accompanied by the elongation of the dipole. The 
geometry analysis indicated that this dipole was 
pure edge. During further straining, segments 1 
and 2 formed a junction, leaving debris behind. 
The dislocation dipole and debris were both 
sessile, which can serve as obstacles to other 
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Fig. 2. Diffraction contrast analyses of (c + a) dislocations in a deformed trapezoidal sample. 
(A) Schematic illustration of the testing configuration. The pillar root is four times larger than 
the top. (B and C) Dark-field TEM images recorded from the same region. Electron beam 
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Fig. 3. In situ TEM showing the motion of (c + a) dislocations in different samples. (A) Expansion 
of a half loop. (B) Motion of an edge segment. (C) Formation of a dislocation dipole and debris. 
Electron beam direction ~[2110]. Schematic drawings of the moving dislocations are shown right to 
the TEM images. Symbol x refers to the pyramidal plane, and x’ is an adjacent pyramidal plane 
parallel to x. The red dashed line refers to the previous location of the moving dislocation. 


dislocations. A likely mechanism for dipole 
formation by way of cross-slip is shown in fig. 
S11, similar to what has been proposed in other 
hexagonal structures such as zinc (24). 

The formation of such a dislocation dipole 
requires cross-slip between two different pyram- 
idal planes. Comparison of these two pyramidal 
planes is shown in fig. S12. Although the cross- 
slip of (c + a) dislocations between pyramidal I 
and II planes was studied in a computer simula- 
tion (25), cross-slip has not been unambiguously 
confirmed by experiments. Traditionally, the 
(c + a) slip plane is determined by slip-trace 
analyses, which is usually compounded by the 
lack of 3D information. We used a series of TEM 
images to construct 3D tomography to reveal the 
configuration and slip plane of (c + a) disloca- 


tion (fig. S13 and movies S8 and S9). Figure 4 
shows 3D tomography of three (c + a) disloca- 
tions generated in c-axis compression. All three 
dislocations exhibit a curvilinear shape, indi- 
cating that they are mixed dislocations. When 
the pyramidal II plane is edge on, the projection 
of dislocation 1 becomes straight and lies on the 
trace of the pyramidal II plane, indicating that 
its gliding plane is pyramidal II (Fig. 4B). Simi- 
larly, the slip plane of dislocation 3 is pyram- 
idal I (Fig. 4C). Dislocation 2 lies on two adjacent 
pyramidal II planes, indicating that cross-slip 
occurred. Further gliding of its segments on the 
two pyramidal planes would generate a disloca- 
tion dipole. 

Our submicrometer-size Mg single crystals ex- 
hibit both higher strength and plasticity than 


2 of 3 


RESEARCH | REPORT 


Fig. 4. 3D reconstruction revealing the pyramidal | and II gliding planes and cross-slip of 

(¢ +a) dislocations. The hexagonal unit cells indicate the viewing directions and the two pyramidal 
planes. (A) Viewing direction is [2110]. Three curvilinear dislocations are selected for 3D analyses. 
(B) Viewing direction is [1010]. In this orientation, the pyramidal II plane is edge on. Dislocation 1 
is projected as a straight line, and hence, its slip plane is pyramidal Il. For dislocation 2, segments 2.1 
and 2.3 lie in different but nearby pyramidal II planes, indicating cross-slip of this dislocation. 
(C) Viewing direction is [1210]. The pyramidal | plane is edge on. Dislocation 3 is projected as a 
straight line, and therefore, its slip plane is pyramidal |. 


their bulk counterpart (26), leading to a phenom- 
enon of “smaller is stronger and more ductile.” 
This phenomenon likely originates from the fol- 
lowing factors. Small crystals usually have few 
preexisting dislocations, and therefore, a large 
amount of stress is required to nucleate dislo- 
cations. Once nucleated, dislocations can readily 
escape to the surfaces before dislocation multi- 
plication, necessitating increasing stress levels to 
nucleate other dislocations or activate other dis- 
location sources to continue plasticity. Such size 
effects result in high stress in submicrometer- 
size Mg, which in turn activates abundant (c + a) 
dislocations to accommodate more plasticity. 
Another reason for the good plasticity is the rich 
surface sources for dislocations per unit volume, 
due to the large surface-to-volume ratio, which 
enables profuse dislocations to be generated 
successively from the crystal surface. Moreover, 
deformation twinning, which often occurs in 
bulk Mg under c-axis straining and introduces 
shear localization and stress concentration (27), 
is not seen in our pillars. Therefore, no potential 
twin-induced crack initiation sites exist in the 
pillars, which may also contribute to the improved 
plasticity. Furthermore, in a more general case, 
the stress concentration associated with a flaw 
in the tiny crystal is expected to be small, as 
the stress concentration factor is related to 
the flaw length over the flaw-tip radius. Our 
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small crystal dimension limits this flaw aspect 
ratio. This prevents premature failure and al- 
lows the small crystal to maximize its potential 
ductility. 

Our findings provide information on the mo- 
bility of pyramidal dislocations and its relation- 
ship with plasticity in pure Mg of small sizes, 
as well as insights into strategies for achieving 
long-sought plasticity in Mg that is traditionally 
difficult to form at room temperature. Simulta- 
neously promoting dislocations and suppressing 
deformation twinning can be an effective strat- 
egy in this regard. Our experimental strategy 
can be extended to understanding the behavior 
of other hexagonal metals by identifying which 
microstructures promote or degrade proper- 
ties such as strength and ductility. 
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The global tree restoration potential 


Jean-Francois Bastin’, Yelena Finegold”, Claude Garcia®*, Danilo Mollicone”, 
Marcelo Rezende”, Devin Routh’, Constantin M. Zohner’, Thomas W. Crowther" 


The restoration of trees remains among the most effective strategies for climate change 
mitigation. We mapped the global potential tree coverage to show that 4.4 billion hectares 
of canopy cover could exist under the current climate. Excluding existing trees and 
agricultural and urban areas, we found that there is room for an extra 0.9 billion hectares 
of canopy cover, which could store 205 gigatonnes of carbon in areas that would naturally 
support woodlands and forests. This highlights global tree restoration as our most effective 
climate change solution to date. However, climate change will alter this potential tree 
coverage. We estimate that if we cannot deviate from the current trajectory, the global 
potential canopy cover may shrink by ~223 million hectares by 2050, with the vast majority 
of losses occurring in the tropics. Our results highlight the opportunity of climate change 
mitigation through global tree restoration but also the urgent need for action. 


measurements (data file S1) (8) of tree cover 
across all protected regions of the world (fig. S1) 
(9, 10). Using global environmental layers (table 
S1) (1D, we examined how climate, edaphic, and 
topographic variables drive the variation in nat- 
ural tree cover across the globe. The focus on 
protected areas is intended to approximate nat- 
ural tree cover. Of course, these regions are not 
entirely free of human activity (JD, presenting 
slightly lower tree cover than expected in some 
regions or higher tree cover than expected in 
other regions because of low fire frequency, but 
these ecosystems represent areas with minimal 
human influence on the overall tree cover. We 
then used a random forest machine-learning ap- 
proach (12) to examine the dominant environ- 
mental drivers of tree cover and generated a 
predictive model (Fig. 1) that enables us to inter- 
polate potential tree cover across terrestrial eco- 
systems. The resulting map—Earth’s tree carrying 
capacity—defines the tree cover per pixel that 
could potentially exist under any set of environ- 


hotosynthetic carbon capture by trees is 

likely to be among our most effective strat- 

egies to limit the rise of CO, concentra- 

tions across the globe (7-3). Consequently, 

a number of international initiatives [such 

as the Bonn Challenge, the related AFR100, and 
the New York Declaration on Forests (4, 5)] have 
established ambitious targets to promote forest 
conservation, afforestation, and restoration at a 
global scale. The latest special report (7) by the 
Intergovernmental Panel on Climate Change 
(IPCC) suggests that an increase of 1 billion ha 
of forest will be necessary to limit global warm- 
ing to 1.5°C by 2050. However, it remains unclear 
whether these restoration goals are achievable 
because we do not know how much tree cover 
might be possible under current or future cli- 
mate conditions or where these trees could exist. 
Previous efforts to estimate global tree cover 
potential have scaled existing vegetation esti- 
mates to the biome or ecoregion levels to provide 
coarse approximations of global forest degra- 
dation (6, 7). However, quantitatively evaluating 


which environments could support trees requires A B 
that we build models using direct measurements 
of tree cover (independent of satellite-derived as 90 - 100 
models) from protected areas, where vegetation «80 - 90 
cover has been relatively unaffected by human 15 
activity. With enough observations that span | 5 70 - 80 
the entire range of environmental conditions, 3} 60 - 70 
from the lowest to the highest possible tree cover, . 50 50 - 60 
we can interpolate these “natural tree cover” es- 2 
timates across the globe to generate a predictive = ee 
understanding of the potential tree cover in the $ 30 - 40 
absence of human activity. ® 25 20-30 

To explore the determinants of potential tree 5 os 
cover, we used 78,774 direct photo-interpretation 
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mental conditions, with minimal human activity 
(Fig. 2A). This work is directly underpinned by 
our systematic dataset of direct tree cover mea- 
surements (entirely independent of climate and 
modeled remote sensing estimates) (73) across the 
globe (fig. S1) (0). 

Across the world’s protected areas (fig. S2), 
tree cover ranged between peaks of 0% in dry 
desert and 100% in dense equatorial forest, with 
fewer values falling between these two extremes 
(figs. S2 and S3). We paired these tree cover mea- 
surements with 10 global layers of soil and climate 
data (table S1) (12). Our resulting random forest 
model had high predictive power [coefficient of 
determination (R?) = 0.86; intercept = -2.05% 
tree cover; slope = 1.06] (Fig. 1); rigorous k-fold 
cross-validation (fig. S4A) (77) revealed that our 
model could explain ~71% of the variation in tree 
cover without bias (R? = 0.71; intercept = 0.34% 
tree cover; slope = 0.99) (fig. S3, B and C). Our 
k-fold cross-validation approach also allows us 
to generate a spatially explicit understanding 
of model uncertainty (figs. S5 and S6) (7). Across 
all pixels, the mean standard deviation around 
the modeled estimate is ~9% in tree cover (28% 
of the mean tree cover) (figs. S5 and S6) (11). As 
such, these models accurately reflected the dis- 
tribution of tree cover across the full range of 
protected areas. We then interpolated this ran- 
dom forest model across all terrestrial ecosystems 
using all 10 soil and climate variables to project 
potential tree cover across the globe under exist- 
ing environmental conditions. 

The resulting map reveals Earth’s tree carry- 
ing capacity at a spatial resolution of 30 arc sec 
(Fig. 2A). The model accurately predicts the pres- 
ence of forest in all existing forested land on the 
planet (fig. S7A) but also reveals the extent of tree 
cover that could naturally exist in regions beyond 
existing forested lands. The most recent Food and 
Agriculture Organization of the United Nations 
(FAO) definition of “forest” corresponds to a land 
of at least 0.5 ha covered by at least 10% tree 
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Fig. 1. Predicted vs. observed tree cover. (A and B) The predicted tree cover (x axes) compared 
with the observed tree cover (y axes). (A) Results as a density plot, with the 1:1 line in dotted 
black and the regression line in continuous black (intercept = —2% forest cover; slope = 1.06; 

R? = 0.86), which shows that the model is un-biased. (B) Results as boxplots, to illustrate the quality 
of the prediction in all tree cover classes. 
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cover and without agricultural activity or human 
settlements (74). Using this definition, our map 
reveals that about two-thirds of terrestrial land, 
8.7 billion ha, could support forest (table S2). 
That value is 3.2 billion ha more than the current 
forested area (fig. S7A) (11, 15). We estimate that 
14 billion ha of this potential forest land is lo- 
cated in croplands (>99%) and urban areas (<1%), 
as delineated by the European Space Agency’s 
global land cover model (fig. S7B and table S2) 
(16), and 1.5 billion ha with croplands as de- 
lineated by Fritz et al. (fig. S7C and table $2) (7). 
Therefore, ~1.7 billion to 1.8 billion ha of po- 
tential forest land (defined as >10% tree cover) 
exists in areas that were previously degraded, 
dominated by sparse vegetation, grasslands, and 
degraded bare soils. 

To avoid the pitfalls of categorical forest defi- 
nitions, we also evaluated the tree canopy cover 
in a truly continuous scale (fig. S8). We refer to 
“canopy cover” as the area of the land that is 
covered by tree crown vertically projected to the 
ground (for example, 50% of tree cover over 1 ha 
corresponds to 0.5 ha of canopy cover) (fig. S8). 
By accounting for all levels of tree cover (from 
0 to 100%), this approach balances the relative 
contribution of different forest types (such as 
woodlands, open forest, and dense forest) and of 
wooded lands outside forests (such as savannas) 
across the globe. 

In total, 4.4 billion ha of canopy cover can be 
supported on land under existing climate con- 
ditions (pixel uncertainty = 28%; global uncer- 
tainty <1%) (table S2) (72). This value is 1.6 billion 
ha more than the 2.8 billion ha existing on land 
today (10, 15). Of course, much of the land that 
could potentially support trees across the globe is 
currently used for human development and agri- 
culture, which are necessary for supporting an 
ever-growing human population. On the basis 
of both the European Space Agency’s global land 
cover model (J6) and on Fritz and colleagues 
cropland layer (17), we estimate that 0.9 billion 
hectares are found outside cropland and urban 
regions (Fig. 2, B and C, and table $2) (77) and 
may represent regions for potential restoration. 
More than 50% of the tree restoration potential 
can be found in only six countries (in million 
hectares: Russia, +151; United States, +103; Canada, 
+78.4; Australia, +58; Brazil, +49.7; and China, 
+40.2) (data file $2), stressing the important re- 
sponsibility of some of the world’s leading eco- 
nomies. By comparing our country-level results 
to the commitments of 48 countries in the Bonn 
Challenge (4), we can provide a scientific eval- 
uation of the country-level restoration targets. 
Approximately 10% of countries have committed 
to restoring an area of land that considerably ex- 
ceeds the total area that is available for restora- 
tion (data file S2). By contrast, over 43% of the 
countries have committed to restore an area that 
is less than 50% of the area available for resto- 
ration. These results reinforce the need for better 
country-level forest accounting, which is critical 
for developing effective management and resto- 
ration strategies. Of course, it remains unclear 
what proportion of this land is public or privately 
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Fig. 2. The current global tree restoration potential. (A) The global potential tree cover 
representing an area of 4.4 billion ha of canopy cover distributed across the world. (B and C) The 
global potential tree cover available for restoration. Shown is the global potential tree cover (A), from 
which we subtracted existing tree cover (15) and removed agricultural and urban areas according to 
(B) Globcover (16) and (C) Fritz et al. (17). This global tree restoration potential [(B) and (C)] 
represents an area of 0.9 billion ha of canopy cover (table S2). 


owned, and so we cannot identify how much 
land is truly available for restoration. However, 
at a global scale, our model suggests that the 
global forest restoration target proposed by the 
IPCC (J) of 1 billion ha (defined as >10% tree 


cover) is undoubtedly achievable under the cur- 
rent climate. By scaling these forest area calcu- 
lations by biome-level mean estimates of carbon 
storage (18, 19), we estimate that vegetation in 
the potential restoration areas could store an 
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Fig. 3. Risk assessment of future changes in potential tree cover. (A) IIlustration of expected losses in potential tree cover by 2050, under the 
“business as usual” climate change scenario (RCP 8.5), from the average of three Earth system models commonly used in ecology (cesmicam5, 
cesmlbgc, and mohchadgem2es). (B) Quantitative numbers of potential gain and loss are illustrated by bins of 5° along a latitudinal gradient. 


additional 205 gigatonnes of carbon (GtC) if they 
were restored to the status of existing forests 
(table S2). 

Our model accurately depicts the regions 
where tree growth is possible under existing 
environmental conditions. However, changing 
climate conditions may alter the area of land 
that could support forest growth over the rest 
of the century, a point that needs to be consid- 
ered when developing long-term restoration 
projects. We tested this possibility by rerunning 
our potential tree cover model under future cli- 
mate conditions, projected under three Earth 
System Models (J0) and two Representative Con- 
centration Pathways (RCP) scenarios (RCP 4.5 
and 8.5) (2). Under both scenarios, the global 
tree carrying capacity is lower than the present 
day potential because of reductions in the po- 
tential area of tropics. This is in stark contrast 
to most current model predictions, which ex- 
pect global tree cover to increase under climate 
change (20). Although warming is likely to in- 
crease tree cover in cold regions with low tree 
cover (for example, in northern boreal regions 
such as Siberia) or with existing open forests 
(such as in tropical drylands) (Fig. 3), our model 
highlights the high probability of consistent de- 
clines of tropical rainforests with high tree cover. 
Because the average tree cover in the expand- 
ing boreal region (30 to 40%) is lower than that 
in declining tropical regions (90 to 100%), our 
global evaluation suggests that the potential glob- 
al canopy cover will decrease under future cli- 
mate scenarios, even if there is a larger total forest 
area with >10% tree cover. Therefore, despite 
potential increases in canopy cover in boreal 
(~130 Mha), desertic (~30 Mha), montane 
(~30 Mha), and temperate (~30 Mha) regions, the 
potential loss of forest habitat in tropical regions 
(~450 Mha) leads to a global loss of 223 Mha 
of potential canopy cover by 2050, correspond- 
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ing to 46 GtC (Fig. 3B and table S3). Such risks 
of loss do not account for future changes in 
land use, such as pasture and cattle raising (7), 
which might also contribute to the urgency of 
the situation. 

These models of future changes in tree cover 
potential reveal insights into how the structure 
of vegetation might change over time. Of course, 
these models are characterized by high un- 
certainty because, unlike the present-day in- 
terpolations, we rely on extrapolation of our 
machine-learning models outside of the existing 
range of global climate conditions. These extrap- 
olations cannot be considered to be future pro- 
jections of potential forest extent because they do 
not incorporate any of the ecological, hydrolog- 
ical, and biogeochemical feedbacks that would 
be associated with changes in forest cover. For 
example, it is possible that elevated CO. concen- 
trations under future climate scenarios might 
enhance the growth of those existing trees, al- 
though recent evidence suggests that increased 
growth rate does not necessarily translate to in- 
crease of carbon storage (27). However, our ap- 
proach has a strong predictive power to describe 
the potential tree cover in the absence of humans 
under any given set of future climate scenarios. 

The global photointerpretation dataset offers 
the capacity to characterize the potential tree 
cover under any given set of environmental con- 
ditions. The resulting openly accessible map can 
serve as a benchmark map to assess restoration 
opportunities (such as tree planting and natural 
assisted regeneration) around the globe, with a 
tree cover of reference that respects the natu- 
ral ecosystem type (for example, from wooded 
savannah to dense forest). However, restoration 
initiatives must not lead to the loss of existing 
natural ecosystems, such as native grasslands, 
that can support huge amounts of natural bio- 
diversity and carbon. Using existing global land- 


cover layers (15-17), our maps reveal that there 
is likely to be space for at least an additional 
0.9 billion ha of canopy cover. If these restored 
woodlands and forests were allowed to mature 
to a similar state of existing ecosystems in pro- 
tected areas, they could store 205 GtC. Of course, 
the carbon capture associated with global res- 
toration could not be instantaneous because it 
would take several decades for forests to reach 
maturity. Nevertheless, under the assumption 
that most of this additional carbon was sourced 
from the atmosphere, reaching this maximum 
restoration potential would reduce a consid- 
erable proportion of the global anthropogenic 
carbon burden (~300 GtC) to date (7). This places 
ecosystem restoration as the most effective solu- 
tion at our disposal to mitigate climate change. 
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Direct mapping of curve-crossing 
dynamics in IBr by attosecond 
transient absorption spectroscopy 


Yuki Kobayashi’, Kristina F. Chang’, Tao Zeng”, 
Daniel M. Neumark”**, Stephen R. Leone”®*** 


The electronic character of photoexcited molecules can abruptly change at avoided 
crossings and conical intersections. Here, we report direct mapping of the coupled interplay 
between electrons and nuclei in a prototype molecule, iodine monobromide (IBr), by 
using attosecond transient absorption spectroscopy. A few-femtosecond visible pulse 
resonantly excites the (B°Io-), Y(O*), and Z(0*) states of IBr, and the photodissociation 
dynamics are tracked with an attosecond extreme-ultraviolet pulse that simultaneously 
probes the I-4d and Br-3d core-level absorption edges. Direct comparison with quantum 
mechanical simulations unambiguously identifies the absorption features associated with 
adiabatic and diabatic channels at the B/Y avoided crossing and concurrent two-photon 
dissociation processes that involve the Y/Z avoided crossing. The results show clear 
evidence for rapid switching of valence-electronic character at the avoided crossing. 


he conventional picture of a photoexcited 

molecule smoothly evolving toward a pro- 

duct state on a single potential surface is 

invalid when degeneracies between neighbor- 

ing states induce nonadiabatic interactions 
(1-3). Molecular dynamics that involve avoided 
crossings and conical intersections have been of 
fundamental interest in chemical physics since 
the seminal Landau-Zener model pioneered in 
the 1930s (4, 5). It is now widely accepted that 
nonadiabatic interactions play key roles in broad 
classes of photochemical reactions, such as photo- 
isomerization in the retinal chromophore (6) and 
photostability of DNA base pairs against ultra- 
violet radiation (7). Realization of laser-based con- 
trol of nonadiabatic processes represents a pivotal 
milestone in recent progress of molecular spec- 
troscopy (8, 9). 

Despite these successes, real-time observation 
of electronic dynamics in nonadiabatic regions 
remains elusive (10). Conceptually, electronic char- 
acter can rapidly change in synchrony with nu- 
clear motion. Experimentally, few-femtosecond 
time resolution is required for the probe, and the 
intrinsic degeneracy makes it challenging to ob- 
tain state-resolved information. Several exper- 
imental methods have been applied to this end 
(11-15), including high-harmonic spectroscopy 
(16) and ultrafast electron diffraction (17). Re- 
cent theoretical studies predicted that transient 
absorption spectroscopy in the x-ray/extreme- 
ultraviolet (XUV) range offers a distinct and pow- 
erful route to measure excited-state dynamics 
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around nonadiabatic regions, with core-level ab- 
sorption capturing the marked reorganization of 
valence electrons (J8, 19). Core-level absorption 
has superb state resolution (20); not only spin- 
orbit fine structure but even near-degenerate 
electronic states in nonadiabatic regions can be 
clearly resolved, as we demonstrate in this work. 
The probe step of core-to-valence transitions, 
being free of strong-field processes, can be di- 
rectly simulated with quantum-chemistry calcu- 
lations (27). Furthermore, when combined with 
high-harmonic generation-based attosecond light 
sources, this all-optical method has the potential to 
attain subfemtosecond probing time resolution (22). 

Here, we report attosecond transient absorp- 
tion mapping of valence-electron dynamics ac- 
companying visible-light excitation of iodine 
monobromide (IBr), a prototype molecule for 
nonadiabatic photodissociation dynamics (23, 24). 
Our primary focus is the switching of electronic 
character ensuing at an avoided crossing be- 
tween excited B (IT: ) and Y(0*) states. An outline 
of the experiment is depicted in Fig. 1A. Photo- 
dissociation is triggered by a few-femtosecond 
visible pump pulse [wavelength (A) = 530 nm, 
Ai = 70 nm, 8 fs, 15 uJ] (Fig. 1B), which selec- 
tively excites the visible absorption band of IBr 
(Fig. 1B) (25) and minimizes the unfavorable 
ionization effects that arise in more conventional 
experiments in which intense near-infrared exci- 
tation is used (26). In the probe step after a delay 
time t, an attosecond XUV pulse produced through 
high-harmonic generation is introduced (45 to 
72 eV, ~200 as) (Fig. 1C), which encodes the time 
evolution of the photoexcited molecule in the 
characteristic core-to-valence absorption signals. 
The XUV probe pulse simultaneously detects the 
two core-level absorption edges of IBr: the N4,5 
edge (4d orbitals) of iodine and the M,,; edge 
(3d orbitals) of bromine. This feature enables full 
tracking of both photofragment electronic states, 


providing considerably more insight than exper- 
iments in which only one fragment is tracked (15). 

The potential energy curves of IBr are shown 
in Fig. 1D. Multiple avoided crossings are present 
in diatomic interhalogens because of the strong 
spin-orbit couplings of the halogen atoms and 
the absence of g-w symmetry for heteronuclear 
diatomics (23, 24). Two physical pictures are in- 
voked to describe the dynamics at avoided cross- 
ings (27); in a diabatic picture, electronic states 
conserve their character as they move along the 
reaction coordinate, whereas in an adiabatic pic- 
ture, electronic states are eigenstates of the elec- 
tronic Hamiltonian and are subject to switching 
of electronic character. The visible pump pulse 
excites the molecule to an attractive B(?I : ) 
state, which undergoes an avoided crossing with 
arepulsive Y(0") state [distance of crossing (R,) = 
3.3 A]. In the diabatic channel at the B/Y avoided 
crossing (Fig. 1D, inset), the photoexcited mole- 
cule remains on the attractive B(?T19-) potential, 
conserving its electronic character, and evolves 
toward the spin-orbit excited I(°P3/2) + Br*?Pyy2) 
asymptote. In the adiabatic channel (Fig. 1D, 
inset), the photoexcited molecule transfers to the 
repulsive Y(0*) potential, rapidly switching its 
electronic character, and proceeds to the ground 
ICP3/2) + Br(’Ps/2) asymptote. The nonadiabatic 
coupling is of intermediate strength for the B/Y 
avoided crossing of IBr, and the measured disso- 
ciation ratio between the diabatic and adiabatic 
channels is approximately 3:1 (28). In the exper- 
iments, resonance-enhanced two-photon pro- 
cesses are also observed for the visible pump 
pulse, which reaches a peak-field intensity of ~5 x 
10” W/cm2?. An electronic-structure analysis 
reveals a large transition dipole moment (~3 D) 
associated with the B — Y transition (26). The 
Y(0*) state undergoes an avoided crossing with 
the higher Z(0*) state (R. = 2.8 A), and the disso- 
ciation across the Y/Z avoided crossing leads to 
the I*?Py2) + Br(?Ps/2) asymptote. The B > Z 
transition is also possible but of less importance 
(~20% compared with B — Y) and is not consid- 
ered further here. 

We first analyzed the transient absorption spec- 
trum of the dissociation products (Fig. 2A, re- 
corded at delay times from 215 to 245 fs). The 
differential optical density (AOD) is defined as 
a logarithmic ratio of the transmitted XUV spec- 
tra with and without the visible pump pulse. 
Iodine and bromine atoms each exhibit three 
absorption lines in the I 4d — 5p and Br 3d —> 4p 
series: 45.9, 46.7, and 47.6 eV for I and 64.5, 65.1, 
and 65.6 eV for Br, corresponding to the Psi — 
°Dsjo, Payg — 7D3/2, and *P3/2 — *Dsyo transi- 
tions, respectively (29, 30). In Fig. 2A, the product 
ratio between the Br and Br* atoms calculated 
from the XUV absorption amplitudes is Br*/Br = 
3.3, which is in reasonable agreement with a pre- 
vious measurement at 525-nm excitation (Br*/ 
Br = 3.0) (28). At the same time, a sizable signal 
from the I* atom is observed, yielding a product 
ratio of I*/I = 0.5. The appearance energy of the 
I* atom (~2.8 eV) is higher than the center pho- 
ton energy of the visible pump pulse (~2.3 eV); 
thus, this signal is attributed to two-photon (or 
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multiphoton) visible-excitation processes. A group 
of weak absorption signals at 55 to 58 eV cor- 
responds to the I 4d — np (n > 5) Rydberg 
series (31). 

Transient absorption spectra as a function of 
delay time are shown in Fig. 2B. The measure- 
ments are carried out from -16 to 245 fs delay 
time at 1.5-fs intervals. To calculate AOD, the 
pump-on/pump-off XUV spectra were each aver- 
aged over 200 frames (40 laser pulses per frame). 
In Fig. 2B, the excited-state absorption (AOD > 0) 
is shown in bright colors, and the ground-state 
bleach (AOD < 0) is shown in gray shades. The 
observed ground-state bleach features, both in 
the I-4d and Br-3d windows, exhibit signatures 
of vibrational coherences in the ground X(1Zo+) 
state [isotopic averages: vibrational frequency 
(We) = 268 m, anharmonicity (@,2) = 0.8 em", 
vibrational period (T) = 125 fs] (32). In previous 
studies, vibrational coherences in the neutral 
ground states launched by strong-field ionization 
were reported (33, 34). In this work, the exci- 
tation mechanism is a resonant single-photon 
process, and distinct nodal structures are ob- 
served in the oscillating absorption signals. We 
performed quantum wave-packet simulations 
to compute the core-level absorption spectra, 
and the observed features are reproduced by 
taking coherences in the v = 0, 1, and 2 vibra- 
tional states (fig. S3). The mechanism for the 
overtone (wv = 2) excitation is attributed to a 
stimulated Raman process enhanced by reso- 
nant visible-light coupling between the X(1Zo+) 
and B(3Ipo+) states. 

The details of the excited-state absorption in 
the I-4d and Br-3d windows are shown in Fig. 3, 
A and B, respectively. The early time signals (0 to 
50 fs) exhibit sweeping shifts to lower photon 
energy, which is indicative of molecular disso- 
ciation proceeding on the B(?IT9: ) potential. The 
evolution of the photoexcited molecule at the 
B/Y avoided crossing is encoded in the subse- 
quent temporal window (50 to 80 fs) (Fig. 3, A 
and B, dashed boxes), and the corresponding 
absorption traces taken at 6-fs intervals are 
shown in Fig. 3, C and D (50, 56, 62, 68, 74, and 
80 fs). The sharp absorption features associated 
with the I*(46.7 eV) and Br*(65.1 eV) atoms are 
nearly invariant with respect to delay time. These 
atomic fragments conceivably originate from two- 
photon processes that occur on a shorter time 
scale, an assignment that was corroborated with 
ab initio simulations. The other broad signals 
(indicated in Fig. 3 with arrows labeled “diabatic” 
and “adiabatic”) exhibit dramatic variations 
both in the absorption amplitude and energy. 
As already mentioned, the diabatic channel at 
the B/Y avoided crossing leads to the I + Br* 
asymptote (Fig. 1D), and the associated wave 
packet conserves its electronic character. In the 
experimental results of the Br-3d window (Fig. 3D), 
the absorption signal that exhibits continuous 
shifts to lower photon energy converges to the 
Br* absorption line and hence is assigned to the 
diabatic channel. In the adiabatic channel, how- 
ever, the photoexcited molecule changes the elec- 
tronic character and proceeds to the ground I + Br 
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Fig. 1. Outline of experiment. (A) Illustration of the experimental setup. (B) Spectral intensity 

of the visible pump pulse (green area) and visible-absorption band (extinction coefficient) of 

IBr (black dashed curve) measured previously in (25). (©) Spectral intensity of the XUV probe 
pulse recorded with (purple area) and without (blue area) the IBr sample and the optical density 
(black dashed curve) calculated from the two spectra. The |-4d and Br-3d edges are covered 

by the broadband XUV spectrum. (D) Adiabatic potential energy curves of IBr, with the visible 

and XUV excitation pathways indicated with vertical arrows. The red and blue wave packets indicate 
the dissociation pathways from one-photon and two-photon processes, respectively. Avoided 
crossings are formed between the B and Y states (R, = 3.3 A) and Z and Y states (R, = 2.8 A). 
(Inset) The adiabatic and diabatic channels at the B/Y avoided crossing. 
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asymptote (Fig. 1D). In the experimental re- 
sults (Fig. 3D), the Br absorption line suddenly 
emerges accompanied by no energy shift at a sepa- 
rate photon energy. The discontinuous evolution 
of the Br signal is a manifestation of the new 
electronic character acquired from the neighbor- 
ing Y(0*) state. In the I-4d window (Fig. 3C), the 
XUV absorption signals associated with the dia- 
batic and adiabatic channels are not as widely 
separated as in the Br-3d window because the 
same I atom is produced in both channels. None- 
theless, the continuous shifts to lower photon 
energy and the sudden emergence of the sharp 
I signal are distinctly resolved, leading to con- 
sistent assignments of the B/Y avoided-crossing 
signals. 

To gain more detailed insight into the transi- 
tory molecular features, we performed ab initio 
simulations of the core-level absorption spectra 
(26). The valence and core-excited electronic 
structures were computed by means of spin-orbit 
generalized multiconfigurational quasi-degenerate 
perturbation theory (SO-GMC-QDPT) (27, 35), and 
from these, the XUV absorption strengths were 
obtained. Nonadiabatic dissociation dynamics 
were simulated fully quantum mechanically by 
numerically solving the time-dependent Schrédinger 
equation. The initial wave packets were taken to 
be in the Franck-Condon region of the B(*Ipo: ) 
state (one-photon process) or on the Y(0*) state 
(two-photon process) to model the relevant visible- 
light excitations. 

Simulated results for the one-photon process 
are shown in Fig. 3, E and F. The sweeping en- 
ergy shifts in the early time window are suc- 
cessfully reproduced by the simulation. The 
B(?Io+ ) state in the Franck-Condon region has a 
[o7x*n**o*"] configuration and is probed by the 
core-level transitions to the x* and o* orbitals. 
The observed absorption lines reflect the multi- 
plet structures of the core-excited states that 
originate from the spin-orbit couplings in the 
core (I-4d, Br-3d) and valence (rn, n*) orbitals 
(fig. S6). Some features are broadened and less 
pronounced in the experiments compared with 
the simulations, which we attribute to the finite 
spectral width of the visible pump pulse and the 
nuclear wave-packet motion on the repulsive 
core-excited potentials. The absorption traces 
in the temporal window for the B/Y avoided 
crossing (50 to 80 fs) are shown in Fig. 3, G and 
H, with assignments of the diabatic and adiabatic 
channels directly made from the wave-packet 
simulations. The continuous (diabatic) and dis- 
continuous (adiabatic) evolution of the XUV ab- 
sorption signals exhibits an excellent match with 
the experiments, providing clear confirmation of 
the experimental signal assignments at the B/Y 
avoided crossing. 

The origin of the I* and part of the Br* sig- 
nals is verified with the two-photon simulations 
(Fig. 3, I and J). The simulated I* signal (Fig. 31) 
matches the experimental result (Fig. 3A) both in 
time and absorption energy. Passage through the 
Y/Z avoided crossing (Fig. 1D) is calculated to 
occur in ~25 fs. These early time dynamics are 
not as clearly resolved in the experiments be- 
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cause of the spectral broadening from the fast 
dissociative motion in the two-photon processes. 
Calculated absorption strengths of the Y(0*) and 
Z(O*) states are provided in fig. S7 for future 
reference. The Br* atom is produced by way of 


the wave packet adiabatically transferring from the 
repulsive Y(0*) potential to the attractive B(?T9: ) 
potential at the B/Y avoided crossing. As such, 
in the simulation, the switching of electronic 
character causes abrupt emergence of the Br* 
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Fig. 3. Experimental and simulated spectra of nonadiabatic dissociation dynamics. (A and B) 
Experimental absorption spectra in the (A) |-4d and (B) Br-3d windows, respectively. The temporal 
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Fig. 4. Orbital energies as a function of 
internuclear distance. The energy difference 
between the core and valence orbitals are 
closely tied to the observed absorption 
energies. Around the B/Y avoided crossing 
(gray area), the energies of the core orbitals 
and valence 2x/n* orbitals are nearly invariant, 
whereas those of the o/o* orbitals show 
increase or decrease versus internuclear 
distance. The displayed molecular orbitals 
are computed at R, = 3.3 A. 


signal (Fig. 3J), and the corresponding absorp- 
tion feature is resolved at ~40-fs delay time in 
the experimental results (Fig. 3B). 

To further investigate the electronic-structure 
information at the B/Y avoided crossing im- 
printed in the core-level absorption spectra, we 
examined the orbital energies and electron con- 
figurations. The energies of the active molecular 
orbitals are shown in Fig. 4 as a function of inter- 
nuclear distance computed at the level of mul- 
ticonfigurational self-consistent field (MCSCF). 
The energy differences between the core and 
valence orbitals are closely tied to the observed 
absorption energies. The exact description of the 
electronic states requires multiconfigurational 
treatments and inclusion of spin-orbit couplings, 
as is done in the SO-GMC-QDPT calculations 
(26). The energies of the I-4d and Br-3d orbitals 
are nearly invariant throughout the reaction co- 
ordinate. This insensitivity to internuclear dis- 
tance indicates that the observed energy shifts in 
the diabatic channel (or no shift in the adiabatic 
channel) reflect the energy variations in the va- 
lence orbitals alone. As for the valence orbitals, the 
energy gradients quantified at the B/Y avoided 
crossing (gray area) are +1.5 eV/A and -1.7 eV/A 
for o and o*, respectively, but only +0.3 eV/A and 
-0.2 eV/A for n and r*, respectively. These trends 
indicate that the x and n* orbitals are effectively 
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no longer contributing to the variation of the 
potential energies when the wave packet reaches 
the avoided crossing. A more direct clue to the 
electronic-character switching is obtained from 
the electronic configurations computed for each 
adiabatic state (36-38). Just before the avoided 
crossing, the B(°II:) state is a mixed configu- 
ration of [o°n*n*%o*"] and [o7n?x**o*"]; a single 
vacancy lies in o*, and the other vacancy is dis- 
tributed between x and n*. After the photoexcited 
molecule adiabatically transfers to the repulsive 
Y(0*) potential, the main configuration changes 
to [o°n°x*°o*7], where the o* orbital is fully oc- 
cupied, and the two vacancies lie in the x and 
n* orbitals. A physical picture emerges from 
these results: The vacancy in the valence orbitals 
switches from o* to n/nx* at the B/Y avoided 
crossing, giving rise to new core-level absorption 
signals with no further energy shifts, reflecting 
the energy invariance of the x and n* orbitals 
versus internuclear distance. 

The visible-absorption band of IBr is a key 
benchmark system for curve-crossing dynamics. 
We applied attosecond transient absorption spec- 
troscopy in combination with the SO-GMC-QDPT 
calculations of the electronic structures and re- 
vealed the switching of the valence-orbital vacan- 
cy from o* to n/n* at the B/Y avoided crossing. 
This intuitive picture was obtained by the power- 
ful ability of the attosecond XUV pulse to sensi- 
tively encode the energies and occupancies of 
valence orbitals in the core-to-valence absorp- 
tion spectra of IBr. The valence-electronic struc- 
ture not only determines the strength of chemical 
bonds, it also governs the reactivity of a molecule 
as represented by frontier-orbital theory; unrav- 
eling the dramatic evolution of valence-electronic 
structures around nonadiabatic regions will great- 
ly extend knowledge of photochemical reactions. 
Application of ultrafast x-ray/XUV light sources 
in the explorations of chemical dynamics is an 
actively advancing field of research (39, 40), and 
this work shows the great promise of attosecond 
transient absorption spectroscopy for mapping 
the evolution of valence electrons in larger mo- 
lecular systems. 
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The great Atlantic Sargassum belt 


Mengqiu Wang’, Chuanmin Hu”, Brian B. Barnes’, Gary Mitchum’, 


Brian Lapointe”, Joseph P. Montoya” 


Pelagic Sargassum is abundant in the Sargasso Sea, but a recurrent great Atlantic 
Sargassum belt (GASB) has been observed in satellite imagery since 2011, often 
extending from West Africa to the Gulf of Mexico. In June 2018, the 8850-kilometer 
GASB contained >20 million metric tons of Sargassum biomass. The spatial distribution 
of the GASB is mostly driven by ocean circulation. The bloom of 2011 might be a result 
of Amazon River discharge in previous years, but recent increases and interannual 
variability after 2011 appear to be driven by upwelling off West Africa during boreal 
winter and by Amazon River discharge during spring and summer, indicating a possible 
regime shift and raising the possibility that recurrent blooms in the tropical Atlantic 
and Caribbean Sea may become the new norm. 


he Sargasso Sea is named after the floating 

mats of Sargassum seaweed, first reported 

by Christopher Columbus in the 15th cen- 

tury. These seaweed attract fish, shrimp, 

crabs, birds, and turtles (7-3), providing 
essential habitats and serving as hotspots for 
biodiversity and productivity. Two species of 
Sargassum, S. fluitans and S. natans, are the 
most abundant in the Sargasso Sea and the Gulf 
of Mexico (1, 4), which are notably connected by 
ocean currents. 

Large quantities of Sargassum have recently 
been reported in the central Atlantic Ocean and 
the Caribbean Sea (5-14), accompanied by fre- 
quent beaching events that have caused serious 
environmental, ecological, and economic prob- 
lems (15, 16). Numerous workshops have been 
held to develop strategies to respond to Sargassum 
inundations (/7, 18). A critical question is whether 
a regime shift in the atmospheric and/or oceanic 
climatic conditions has led to the recent changes. 
Several hypotheses have been proposed concern- 
ing the relative roles of warming temperatures, 
climate change, and nutrient enrichment (19-23), 
but the lack of large-scale Sargassum data has 
prevented investigators from reaching a solid 
conclusion. 

We attempt to address this question using long- 
term satellite data, numerical models, and field 
measurements. The 19-year record of observations 
from the Moderate Resolution Imaging Spectro- 
radiometer (MODIS) satellite instrument reveals 
a recent, recurrent Sargassum belt extending 
across the Intra-Americas Sea and the tropical 
Atlantic (Fig. 1). The environmental and field 
data, along with numerical models, help us to 
understand the formation of this great Atlantic 
Sargassum belt (GASB), the connectivity be- 
tween different regions, as well as the mecha- 
nisms behind the tipping point in 2011 and 
interannual variations in subsequent years. 
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The entire monthly sequence of Sargassum 
abundance distributions (movie S1) shows that 
from 2000 to 2010, the central Atlantic showed 
very low abundance (5, 6, 12, 13), with occa- 
sional small quantities near the Amazon River 
mouth from August to November. The first mas- 
sive Sargassum bloom in the central Atlantic 
started in 2011 (6), and in later years the bloom 
developed to a GASB extending from West Africa 
to the Caribbean Sea and into the Gulf of Mexico 
(Fig. 1 and movie S1). 

The GASB formed in the summer months 
(Northern Hemisphere in this paper) of 2011-2018 
except in 2013 (Fig. 1C). In 2015 and 2018, the 
GASB showed the highest coverage, extending 
>8850 km and carrying a wet biomass of >9 mil- 
lion tons (>20 million tons in June 2018) (24). 
Once reaching the Gulf of Mexico, the belt fol- 
lowed the Loop Current and Gulf Stream to enter 
the North Atlantic Ocean. Some Sargassum were 
transported directly into the North Atlantic from 
the central west Atlantic following the Antilles 
Current (Fig. 1C). 

Although multiple sources of Sargassum may 
exist, the shape of the GASB is consistent with 
advection by the ocean circulation patterns in 
the tropical Atlantic. Through particle-tracking 
numerical experiments that account for both 
physical transport and biological growth, the 
July GASB patterns were well reproduced by for- 
ward tracking of simulated Sargassum particles 
for 6 months (fig. S1). This holds true even when 
a uniform particle distribution is used to initial- 
ize the model, although a more realistic initial- 
ization using MODIS observations led to improved 
model performance in reproducing the GASB pat- 
terns in July (25, 26) (fig. S1, A and B). Further- 
more, after accounting for biological factors under 
various scenarios, Sargassum density in the tropics 
could be captured even more accurately (fig. S1). 

In both July and January, most of the sim- 
ulated Sargassum particles in the central Atlantic 
are traced back to the same region, with a very 
weak connection to West Africa and almost no 
connection to the North Atlantic or Caribbean 
Sea (fig. S2) (25). This suggests that the major 
blooms in the central Atlantic likely developed 


locally rather than from seed populations in the 
Sargasso Sea. The weak connection between West 
Africa and the central Atlantic (fig. S2B) indicates 
that some Sargasswm may enter the central At- 
lantic from West Africa and bloom there. These 
observations match well with previous modeling 
work emphasizing the role of North Equatorial 
Recirculation Region (NERR) as a potential source 
region (19-21, 27) and other modeling efforts on 
the regional connections (28, 29). Field measure- 
ments of species compositions (30) also suggest 
that the Sargasso Sea is unlikely to be the main 
source of the blooms in the central Atlantic. 

It is natural to ask, then, what conditions in 
the central Atlantic triggered the first Sargassum 
bloom in 2011? Small amounts of Sargassum 
existed in the central Atlantic in previous years 
(movie S1), representing the seed populations. 
In 2009, higher-than-usual nutrients from the 
Amazon River discharge (31) (Fig. 2B and figs. 
S3A and S4C), as well as from upwelling in the 
eastern Atlantic (Fig. 2D, fig. S4D, and table $2), 
could stimulate Sargassum growth (32, 33), al- 
lowing massive blooms to occur. A related ques- 
tion is, why did a massive bloom not occur in 2010? 
We suggest that this was due to higher-than-usual 
sea surface temperatures (hereafter, SSTs) in 2010 
(Fig. 2D), which, according to laboratory experi- 
ments (34) (fig. S5) and our analyses of satellite- 
derived Sargassum change rates (26), would 
suppress Sargassum growth. In 2011, the SSTs 
were more suitable for Sargassum growth and 
the recycled nutrients from previous years and 
new nutrients from the current year would have 
created the correct conditions to initiate a massive 
bloom. Also, the low salinity induced by a large 
freshwater input from 2009 and 2010 would likely 
hinder Sargassum growth (34). A reasonable sce- 
nario of the 2011 bloom is therefore that nutrient 
accumulations from 2009 due to stronger upwell- 
ing in the eastern Atlantic and excessive Amazon 
River discharge in the western Atlantic provided the 
initial conditions, whereas high temperature and 
low salinity in 2010 delayed the bloom until 2011. 

After 2011, the Sargassum abundance in the 
central Atlantic showed similar seasonality as 
in the Gulf of Mexico, with increased abundance 
from January to June and decreased abundance 
from July to December (Fig. 3B and fig. S6). 
Considering the weak seasonality in insolation 
in the tropics, this seasonality might be the result 
of an innate biological clock (circannual rhythm), 
as exists in other brown seaweeds (35), combined 
with the seasonal nutrient supply (26). Such a 
mechanism is discussed further in the supple- 
mentary materials. 

In 2012, a Sargassum bloom first developed in 
spring and summer, but decreased rapidly from 
August to December. The earlier growth could 
have been a result of higher nutrient supply from 
upwelling processes and lower SSTs in the cen- 
tral east Atlantic from winter 2011 to spring 2012, 
whereas the rapid decrease after August could 
have been due to the overall lower nutrient sup- 
ply from the Amazon River during 2010-2011 
and the relatively higher SSTs after late summer. 
By January 2013, most Sargassum disappeared 
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Fig. 1. Sargassum distributions in the Gulf of Mexico and the Atlantic Ocean. (A) Monthly mean Sargassum areal coverage in the Caribbean Sea 
and the central Atlantic Ocean, with a maximum of ~ 6000 km? or >20 million tons in June 2018. The year mark starts from January. (B) Monthly mean 
Sargassum density (% cover) in January, April, July, and October of 2011-2017 after excluding the nonbloom year of 2013. (©) Monthly mean Sargassum 
density for the month of July from 2011 to 2018. The GASB is observed in all years except 2013. 


Wang et al., Science 365, 83-87 (2019) 5 July 2019 2 of 5 


RESEARCH | REPORT 


A 


2000 


Winter NAO index 
(previous year) 


Amazon 
River Mouth 


2005 201 


Sargassum area coverage 


61°W 50°W 38°W 


co 


, (104 m?/s) 
ie) 


n° 


Amazon River discharge anomaly 
sb 


a 


% cover 
- >0.02 


- 0.01 


°C 
- >1.0 


= 0.5 
- 0.0 


- -0.5 


61°W 50°W 38°W 
Amazon 
River Mouth 


Fig. 2. Environmental conditions and climate indices used to explain interannual changes of GASB. (A) Mean NAO index averaged from 
December to February (winter NAO) for 1990-2018. (B) Seasonal mean discharge anomaly of the Amazon River from 1990 to 2018 measured at the 
Obidos station. (C) Latitude-averaged (from 5°S to 23°N) Sargassum monthly areal coverage density from 2009 to 2018. (D) Latitude-averaged monthly 
mean SST anomaly from 2009 to 2018. In (C) and (D), the vertical lines mark the locations of 88°W, 61°W, 50°W, 38°W, and 15°W, representing the 
Yucatan peninsula coast, Barbados coast, the Amazon River mouth, the middle of the central Atlantic, and the West Africa coast, respectively. 


across the central Atlantic (movie S1). The re- 
duced seed populations from 2012, higher SSTs 
in the growth phase, and limited nutrients to- 
gether appeared to lead to a nonbloom year in 
2013 (table S2 and movie S1). 

In 2014, Sargassum grew rapidly during spring 
and summer. Unlike 201], the first Sargassum 
aggregation was identified in early January in 
the central east Atlantic. This rapid growth can 
likely be attributed to the nutrient enrichment 
from the West Africa upwelling from winter 2013 
to spring 2014 (table S2). The bloom continued to 
develop when reaching the central west Atlantic, 
where the Sargassum was nourished by high 
riverine nutrients accumulated in 2013-2014 
(table S2 and Fig. 2B). The favorable growth 
conditions (higher nutrients and lower SSTs) in 
the central east Atlantic and central west Atlantic 
would also have sustained the winter bloom, 
providing higher-than-usual seed populations to 
initiate the massive bloom in 2015 (table S2). 

In 2015, although the initial Sargassum growth 
rate was not as high as that in 2014, the seed 
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population was much larger than it was in pre- 
vious years. This, along with higher nutrients 
from the central east Atlantic in spring and early 
summer as well as from the Amazon River from 
2014 to 2015, led to a massive Sargassum bloom 
in 2015 (table S2). The higher concentrations of 
Sargassum required more nutrients to sustain, 
which would also explain the lower change 
rate in spring 2015 (Fig. 3, Aand B). From fall to 
winter, Sargassum decreased faster than in 
2014 under comparatively warmer waters and 
reduced nutrients (table S2). This significant 
decrease continued until February 2016. 

In 2016, only a small amount of Sargassum 
survived from 2015 and it was located in the 
central east Atlantic. The early growth was lim- 
ited by the lower nutrient supply from West 
African upwelling from winter 2015 to spring 
2016 (table S2). When the Sargassum was trans- 
ported eastward to the central east Atlantic after 
December 2016, however, it grew faster because 
of local nutrient enrichments (table S2), provid- 
ing seed populations for the 2017 bloom. 


In 2017, the bloom generally developed faster 
than in 2016 possibly as a result of higher nu- 
trient supplies from the Amazon River and the 
West Africa upwelling since winter 2016 (table 
$2). During winter 2017, the Sargassum change 
rate was much higher than it was in 2016, which 
would also have benefited from the lower SSTs 
and the higher availability of nutrients. The large 
amount of Sargassum that developed in the winter 
months helped to form the bloom in 2018. 

Overall, the recent bloom events show connec- 
tions to nutrient enrichment and climatic varia- 
tions. Higher wintertime North Atlantic Oscillation 
(NAO) values in the bloom years correlated well 
with lower SSTs and stronger upwelling (Fig. 2) 
(36, 37). Evidence for nutrient enrichment is also 
found in the Sargassum nutrient compositions. 
Specifically, the N:P ratios of the recent neritic 
samples show an increasingly P-limited growth 
compared with the historical baselines, which 
would be a result of long-term nutrient enrich- 
ment, especially N enrichment, in recent years 
(24). Other evidence to support recent nutrient 
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Fig. 3. Sargassum biomass and change rate from April 2011 to December 2018. (A) Monthly mean Sargassum biomass in the Caribbean Sea 

and central Atlantic. These estimates represent lower bounds because satellite measurements are insensitive to Sargassum accumulations in the vertical 
direction. The inset shows the correlation between the mean change rate in November and December (derived from the mean biomass change from 
October to December) with the annual mean Sargassum biomass in the next year. The red dot marks the data from 2019 (biomass averaged between 
January and April 2019) (B) Sargassum monthly change rate since 2011. The gray dashed line marks the climatological change rate between 2011 


and 2018 except for 2013. 


enrichment in the central west Atlantic comes 
from increased deforestation and fertilizer use in 
Brazil and increased water-column nitrogen from 
2010 to 2018 (fig. S4). 

The interannual changes in Sargassum blooms 
could be accounted for by changes in seed pop- 
ulations and oceanographic conditions, but a 
critical question remains: Can we predict future 
blooms based on these hindcast analyses? The 
following conditions appear to be associated with 
massive Sargassum blooms at magnitudes com- 
parable to those in 2015 and 2018: (i) large seed 
populations during winter as a result of the 
previous year’s bloom; (ii) higher nutrient supply 
from the West Africa upwelling in winter months, 
which can be inferred from higher chlorophyll 
levels and lower SSTs in satellite imagery; and (iii) 
higher nutrient supply from the Amazon River 
input but normal or lower SSTs during the cur- 
rent year. If these conditions are met, then a 
massive bloom is likely to occur in the central 
Atlantic, followed by severe beaching events in 
the Caribbean Sea in later months. Furthermore, 
during November to December, the Sargassum 
change rates showed negative correlations with 
SSTs (fig. S7, A and B), suggesting that the former 
might serve as an indicator for possible blooms 
in the following year (Fig. 3A, inset), with a lead 
time of at least 3 to 4 months. 

Finally, we recognize that there are active 
discussions within the research community on 
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the mechanisms driving the recent trends of 
Sargassum blooms. The explanation presented 
here is based on the physical connectivity across 
several regions, on the analysis of several envi- 
ronmental factors, on limited field studies, and 
on the satellite-based Sargassum observations. 
These modeling and observationally based analy- 
ses, although reasonable to the best of our knowl- 
edge, still require validation in the future and 
admittedly may not rule out other explanations. 
Conversely, the recurrent GASB clearly shows a 
regime shift after 2011 in bloom patterns and 
possibly in oceanographic conditions as well. A 
critical question is whether we have reached 
the point where recurrent GASB and beaching 
events may become the new norm. Under con- 
tinued nutrient enrichment due to deforesta- 
tion and fertilizer use in agriculture (fig. $4), 
along with the substantial mass of Sargassum 
seed populations lingering in the tropics (movie 
SD, the answer is likely positive, and more re- 
cent satellite observations between January 
and April 2019 also support this interpretation. 
However, the considerable Sargassum accu- 
mulations along the pathway of the GASB 
underline the need for multidisciplinary re- 
search to better understand their ecological 
and biogeochemical impacts (24, 38), as well 
as their impacts on coastal environments, tour- 
ism, economies, and human health (39), espe- 
cially if the role of Sargassum changes from 


that of an essential habitat to that of a severe 
and perpetual nuisance. 
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LIFE SCIENCE TECHNOLOGIES 


new products: general lab equipment 


Field-Flow Fractionation System 
The EAF2000 from Postnova 
Analytics combines the principles 
of electrical and asymmetrical 
field-flow fractionation (FFF) in a 
single platform. In this system, 
electrical and cross-flow fields are 
applied simultaneously across 
the FFF channel, enabling separation by size and simultaneous 
measurement of particle charge based on electrophoretic 
mobility. Combining these two powerful separation techniques 
in one platform opens the door to characterizing complex 
samples that have proven resistant using other techniques. 
EAF2000 can be used to separate and characterize complex 
proteins, antibodies, and viruses as well as charged 
nanoparticles, colloids, and polymers. 

Postnova Analytics 

For info: +49-(0)-8191-985-6880 

www.postnova.com 


Horizontal and Vertical Gel Electrophoresis Equipment 
Syngene announces its range of robust horizontal and vertical gel 
electrophoresis equipment. Featuring sturdy injection-molded 
construction and corrosion-resistant platinum electrodes, all 
tanks offer a robust, leak-proof environment, guaranteeing 

user Safety and long product life. With a choice of four sizes, the 
horizontal gel units (SynMini, SynMidi, SynChoice, and SynMaxi) 
allow scientists to run gels as small as 7 cm x 7 cm up to 20 cm 

x 20 cm. Each tank comes with two or three (for the SynChoice) 
sizes of cleverly designed, indentation-free, UV-transparent 

gel trays and a choice of combs in four different thicknesses. 
Syngene’s SynPAGE and SynWave vertical electrophoresis tanks 
come in two sizes and can be used with 10 cm x 10 cm or 20 cm x 
20 cm gels to run up to four handcast (additional plates required) 
or two commercial precast gels from any major supplier. For 
cost-effective versatility, scientists wanting to transfer their 
proteins to Western blots can also fit the new Syngene tanks with 
a modular blotting insert. 

Syngene 

For info: 800-686-4407 
www.syngene.com/product-category/electrophoresis 


DNA Sequencing Platform 

Pacific Biosciences launches it latest DNA sequencing platform— 
the Sequel Il System—delivering highly accurate long reads. 

The system includes the new SMRT Cell 8M as well as chemistry, 
instrument control software, and the SMRT Link software 
package. Using Sequel Il, customers can comprehensively detect 
human variants ranging in size from single-nucleotide changes 
to large, complex structural variants. The system is also ideal 
for standard applications such as de novo assembly of large 
genomes and whole-transcriptome analysis using the Iso-Seq 
method. High-fidelity reads can enable comprehensive human 
variant detection with >95% precision and recall of structural 
variants, >99% precision and recall of single-nucleotide variants, 


Produced by the Science/AAAS Custom Publishing Office 


and =96% precision and recall of insertions and deletions with as 
little as 15-fold coverage. 

Pacific Biosciences 

For info: 650-521-8000 

www.pacb.com/sequel 


Next-Generation Objectives 

Olympus introduces its next-generation X Line objectives. 
Ultrathin lenses manufactured by a revolutionary polishing 
technique enable the objectives to simultaneously boost 
numerical aperture, flatness, and chromatic correction— 
providing exceptional image quality for researchers and clinicians 
alike. Olympus has harnessed this proprietary technology 

to deliver its high-performance X Line objectives for clinical 

and research applications as well as UPLAPO-HR objectives, 

the world’s first plan-corrected apochromat objectives 

with a numerical aperture of 1.5 for total internal reflection 
fluorescence (TIRF) microscopy and superresolution applications. 
The new objectives will deliver significant improvements to a 
plethora of research and clinical applications without the need to 
change or upgrade imaging systems. 

Olympus 

For info: 704-877-8801 

www.olympus-lifescience.com 


Mass Spectrometry Systems 

Waters offers two new high-resolution mass spectrometry 
systems. The SELECT SERIES Cyclic IMS seamlessly integrates 
cyclic ion mobility (cIM) technology into a high-performance 
research-grade time-of-flight mass spectrometer enabling 
limitless experimental potential. By providing high levels of 
flexibility through inlets and acquisition modes, the SYNAPT 
XS delivers greater freedom of analytical choice, supporting 
scientific creativity and technical success for any application. 
Waters Corporation 

For info: 800-252-4752 

www.waters.com 


Triple Quad LC/MS-MS System 

The SCIEX Triple Quad 5500+ LC-MS/MS System - QTRAP Ready 
delivers superior quantitative results in a single injection 
workflow. When you activate the optional QTRAP functionality, 
your instrument acts as a linear ion trap. This allows you to 
acquire even more data from your sample without sacrificing the 
sensitivity or quality of your multiple reaction monitoring (MRM) 
analysis. The QTRAP technology incorporates additional scan 
types, such as MRM3 (MS/MS/MS), enhanced product ion scans, 
and many more. The additional scans can run at the same time 
you are acquiring your MRM, without any impact on performance 
or data quality. Whether you are focusing on pharmaceutical 
drug discovery workflows, clinical research, food and beverage 
testing, complex environmental analysis, or academic studies, or 
searching for answers from crucial forensic samples, the 5500+ 
system is equipped to handle the challenge. 

SCIEX 

For info: 877-740-2129 

sciex.com 


Electronically submit your new product description or product literature information! Go to www.sciencemag.org/about/new-products-section for more information. 


Newly offered instrumentation, apparatus, and laboratory materials of interest to researchers in all disciplines in academic, industrial, and governmental organizations are featured in this 
space. Emphasis is given to purpose, chief characteristics, and availability of products and materials. Endorsement by Science or AAAS of any products or materials mentioned is not 


implied. Additional information may be obtained from the manufacturer or supplier. 
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University of 
Zurich” 


Faculty of Science 


The Faculty of Science at the University of Zurich invites 
applications for an 


Assistant Professor 
Tenure Track in Digital 
Bioinorganic Chemistry 


We are seeking candidates with an outstanding track record 
to build an internationally recognized research program in 
digital bioinorganic chemistry. While all areas of biological 
inorganic chemistry will be considered, upon equal qualifi- 
cations, preference will be given to applicants with an interest 
in research directions complementing those of existing 
groups. This professorship lies within the UZH Digital 
Society Initiative and hence its research must have a strong 
link to aspects of digitalization, e.g. Big Data, bioinformatics, 
simulations, spectroscopy, high throughput, machine learning, 
or data mining. The selected candidate is expected to lead an 
independent research group linking experimental and 
computational methods and to participate in teaching at all 
levels of the BSc, MSc and PhD programs. The University of 
Zurich has an established program for career progression 
allowing promotion to associate and full professor upon 
successful evaluation. 


The University of Zurich provides generous research support, 
including dedicated funds for personnel, running expenses 
and competitive start-up packages. Moreover, the successful 
applicant is expected to acquire external research funding. 
Zurich's scientific environment encompasses a rich spectrum 
of diverse activities spanning physical to biomedical research 
that provides extensive opportunities for collaboration with 
research groups at the University of Zurich and other leading 
Swiss research institutions. 


The employment conditions for this position follow the legal 
regulations of the University of Zurich 

(please see www.prof.uzh.ch/de.html), including part-time 
options. The University of Zurich is an equal opportunities 
employer and in particular strives to increase the percentage 
of women in leading positions (please see 

https: / / www.mnf.uzh.ch/en/mnf-gleichstellung.html). 
Therefore, qualified female researchers are particularly en- 
couraged to apply. The city of Zurich combines a stimulating 
cultural scene in a modern European city with easy access to 
beautiful natural surroundings. 


Academics with the appropriate qualifications are kindly 
invited to submit their applications including: 

— acurriculum vitae 

— lists of publications and research funding, 

— a detailed research plan and a teaching outline 

— names and contact details of three referees 


Please upload your application to 
http:/ /www.mnf.uzh.ch/dbc by July 31, 2019. 


For further information, please contact Prof. Dr. Roger Alberto 
(email: ariel@chem.uzh.ch). 
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INTEGRATING SCIENCE. ENGINEERING, AND MEDICINE | 


ASSOCIATE or SENIOR EDITOR for SCIENCE 
TRANSLATIONAL MEDICINE 


The American Association for the Advancement of Science (AAAS) and 
Science Translational Medicine are seeking a talented scientist, physician 
scientist or clinician interested in biomedical/translational research to 
join the Science Translational Medicine editorial team in Washington 
DC, USA or Cambridge, UK. 


This editorial position is a fulltime position designed for a PhD scientist, 
MD/PhD physician scientist or MD clinician with research experience. 
We seek applicants who have a broad interest in translational research, 
the ability to rigorously and critically evaluate scientific and/or clinical 
research papers, as well as excellent communication and networking 
skills. Outstanding candidates in any biomedical field will be considered; 
expertise in one or more of the following fields would be an advantage: 
metabolism, diabetes/obesity, liver disease, gastroenterology, cell and 
gene therapy, respiratory disorders, clinical trials. 


The tasks include but are not limited to: 


Manage the evaluation, review, and editing of submitted research 
manuscripts in a broad range of biomedical fields 

Judge the scientific value of research and select reviewers for submitted 
manuscripts 

Discuss and make recommendations regarding manuscripts and 
reviewer comments with the editorial team, members of the Board 
of Reviewing Editors, and the authors 

Write short summaries of research papers for publication 

Guide authors on manuscript revisions and edit the manuscripts for 
scientific content and style 

Follow manuscripts through the production process to ensure material 
is published in a timely manner 

Commission and edit Review, Perspective and Focus articles on a 
broad range of topics 

Commission and edit review and opinion content for special issues 
Foster relationships and communication with the scientific community 
through meetings, lab visits and professional contacts 

Represent Science Translational Medicine at scientific meetings 
nationally and internationally. 


The minimum qualifications to be competitive and considered for 
the position are: 


Mastery of one or more biomedical fields typically acquired through 
completion of a doctoral or medical degree in at least one biomedical 
or clinical research field 

2 to 5 years post PhD or MD experience, including postdoctoral 
research experience in one or more translational fields and multiple 
publications 

Ability to work constructively as a member of a tight-knit dynamic 
team 

Comprehensive knowledge of scientific research methods in order to 
discuss technical issues with authors 

Excellent outreach skills in order to effectively recruit top research 
papers to the journal 

Exceptional written, communication, and listening skills in order to 
communicate with authors and reviewers in evaluating, editing and 
modifying manuscripts 

Prior scientific editorial experience is not essential but is preferred 


Application Process 

Submit a resume and cover letter outlining qualifications and interest in 
the position by August 6, 2019. Applications are reviewed on a rolling 
basis. We will contact you with next steps if you are selected for an 
interview. 


Application link: http://bit.ly/2WvSsv6 


AAAS is the world’s largest general scientific membership organization. 
Please visit the AAAS careers website http://www.aaas.org/page/ 
employment-aaas to get more information, and to apply online. 


AAAS is an Equal Opportunities Employer. All qualified applicants 
will receive consideration for employment without regard to race, 
color, religion, sexual orientation, gender, gender identity, national 
origin, age, disability, veteran status, or other protected category. 
AAAS uses E-Verify to confirm the employment eligibility of 
all newly hired employees. 
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G UNIVERSITY OF 
Wel CENTRAL FLORIDA 


Professor and Dean, College of Optics and Photonics 


The University of Central Florida (UCF) invites applications and nominations for the position of dean of the College of Optics and Photonics. The 
College of Optics and Photonics, which houses CREOL (Center for Research and Education in Optics and Lasers), is one of the world’s foremost 
institutions for research and education in optics and photonics. 


Located in Orlando, UCF is one of the nation’s most dynamic metropolitan research universities. We are seeking a visionary leader for the college. 
The dean of the college is responsible for working with the faculty to foster excellence in research and education while continuing to reinforce 
the role of the college within the university and industry. As the academic and administrative leader of the college, the dean’s responsibilities will 
include, but are not limited to: fiscal management, strategic planning, and academic programs. 


The next dean of the College of Optics and Photonics shall possess: 
a strong commitment to advance research, teaching, and learning; 
an ability and willingness to advocate for student success and faculty excellence; 
evidence of advancing diversity and inclusivity; 
the ability to inspire the college’s faculty, students and staff, university community, industry partners, donors, and alumni to dedicate their 
time and talent in pursuit of a shared vision for the college; 
a strong understanding of current trends and research directions in optics and photonics essential for promoting and accelerating research 
and discovery in the college; 
a working knowledge of the industry, its main challenges and strategic directions for the future. 


The position reports to the provost and vice president for Academic A ffairs. 


The College of Optics and Photonics offers M.S. and Ph.D. degrees in Optics and Photonics and a B.S. degree in Photonic Science and Engineering. 
The faculty engage in a broad spectrum of research covering all areas of optics and photonics, ranging from fundamental physics to device and 
systems research. 


The college has 35 core faculty members, and over 20 faculty members from other UCF colleges with joint appointments, about 50 research 
scientists and postdoctoral scholars, over 140 graduate students, and 150 undergraduate students. The college is home to the Center for Research 
and Education in Optics and Lasers (CREOL), the Florida Photonics Center of Excellence (FPCE), the Townes Laser Institute (TLI), the Townes 
Institute for Science and Technology Experimentation Facility (TISTEF), and the Institute for the Frontier of Attosecond Science and Technology 
(iFAST). Additional information is available at https://www.creol.ucf.edu. 


UCF and its 13 colleges provide opportunities to more than 68,000 students, offering 189 bachelor’s and master’s degrees, and 32 doctoral programs. 
Students come from all 50 states and over 147 countries. A priority on diversity and inclusiveness led UCF to co-found the University Innovation 
Alliance, a consortium of large public research universities committed to better serving first-generation, low-income students. U.S. News & World 
Report ranked UCF the 10th “Most Innovative School” in its Best Colleges 2019 report. Additionally, UCF is one of 56 public universities with 
the Carnegie Foundation’s highest designation in two categories: community engagement and very high research activity. As one of the largest 
universities in the country, UCF is committed to innovative community partnerships, world-class research with local and global impact and the 
integration of technology and learning. For additional information about the University of Central Florida, please visit www.ucf.edu. 


Minimum Qualifications 
Applicants must hold an earned doctorate from an accredited institution in a related field; have a strong research record and commitment to teaching; 
have a strong understanding of academic affairs; and have credentials to be appointed at the level of full professor with tenure at the college. 


Preferred Qualifications 
Preference will be given to applicants with demonstrated leadership experience in higher education, industry, or government lab, and a strong 
recognition in the field. 


Additional Application Materials Required 

UCF requires all applications and supporting documents to be submitted electronically through the Human Resources website, https://jobs.ucf. 
edu/en-us/job/497427/professor-and-dean-college-of-optics-and-photonics. In addition to completing the online application, candidates must 
upload a letter of interest, a complete CV, and contact information for three (3) professional references. 


References will be contacted only for short-listed candidates after notification. 
The position will remain open until filled. Screening of applications will begin August 15, 2019. 


Please contact the search committee chair, Dr. Michael Georgiopoulos, at michaelg@ucf.edu with questions, nominations and expressions of 
interest about the position. 


Equal Employment Opportunity Employer 

As an Equal Opportunity/Affirmative Action Employer, UCF encourages all qualified applicants to apply, including women, veterans, individuals 
with disabilities, and members of traditionally underrepresented populations. UCF ‘s Equal Opportunity Statement can be viewed at: 
http://eeo.ucf.edu/documents/PresidentsStatement.pdf. As a Florida public university, UCF makes all application materials and selection procedures 
available to the public upon request. 


Tenured Professor 
in Developmental Biology 


Department of Molecular and Cellular Biology 
Faculty of Arts & Sciences, Harvard University 


Department of Molecular 
and Cellular Biology 


The Department of Molecular and Cellular Biology at Harvard University seeks to appoint a tenured 


professor in developmental biology, with an emphasis on studying fundamental biological questions in 
either conventional or unconventional model see We are particularly excited about candidates who employ 

imited to, imaging genomics, and computation. The appointment is expected 
to begin on July 1, 2020 or as soon as feasible. The professor will teach and advise at the undergraduate and graduate levels. The Department offers an 
interactive research community with broad interests, state of the art animal facilities and core facilities for imaging, proteomics, genomics and bioinformatics, 
and close association with wider Harvard science initiatives such as the Quantitative Biology Initiative, the Harvard Stem Cell Institute, the Center for Brain 


modern quantitative methods such as, but not 


Science, the Broad Institute, and the Center for Nanoscale Systems. 


A doctorate ee is required. Applications should include: curriculum vitae, a research vision statement describing future plans, a teaching statement 
ing approach and philosophy, and a statement describing efforts to encourage diversity, inclusion, and belonging, including past, current, 


describing teac 
and anticipated future contributions in these areas. 


candidates beginning in the first weeks of October. 


underrepresented groups. 


Candidates are encouraged to apply by September 1, 2019. Applications will be reviewed until the position is filled. We anticipate interviewing short-listed 


We are deeply committed to improving the diversity of Harvard's faculty and scientific community. We strongly encourage applications from women and other 


Submit applications to: 


For information contact Polina Kehayova | kehayova@fas.harvard.edu | www.mcb.harvard.edu 


== HARVARD 


~ UNIVERSITY 


We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, 
color, religion, sex, national origin, disability status, protected veteran status, gender identity, sexual orientation, pregnancy and 
pregnancy-related conditions or any other characteristic protected by law. 


CAREER OPPORTUNITY 
Faculty Position, , 

: St. Jude Children’s 
Cell and Molecular Biology _ Research Hospital 


Finding cures. Saving children. 


Memphis, TN 
The Department of Cell and Molecular Biology seeks applicants 
for faculty positions at all levels (Assistant, Associate or Full). 
St. Jude Children’s Research Hospital offers a remarkable 
opportunity to perform cutting-edge research with outstanding, 


long-term institutional support and exceptional core facilities in an 
environment of open collaboration. 


Successful applicants will possess a Ph.D., M.D., or both; and 
present a vibrant research program with potential for interaction 
both within the department and across the institution. 


Interested applicants should submit © 


a CV and a 2-page letter of research “ 
accomplishments and future interests. \ ¥ 


Applicants for Assistant Member 

positions should have three letters of 
recommendation. Applicants for Associate/ 
Full Member positions should send contact 
information for three references. 


J. Paul Taylor, M.D., Ph.D. 
Chair, Department of Cell and Molecular 
St. Jude Children’s Research Hospital 
262 Danny Thomas Place 

Memphis, TN 38105 
marquetta.nebo@stjude.org 

Apply online: bit.ly/facultycellbiology 


St. Jude Children’s Research Hospital is an 
equal opportunity employer. 


Tulane | 
University 


SCHOOL OF MEDICINE 
Assistant Professor-Tenure Track 


The Department of Pharmacology at Tulane University School of Medicine 
is currently recruiting faculty members with a Ph.D. or equivalent degree for 
tenure-track position at the level of Assistant or Associate Professor. Areas of 
preferred expertise include Vascular Biology, Mitochondrial Biology, and 
Neuroscience. 


This position will be supported by significant laboratory space, competitive 
salaries and excellent start-up packages. The requirements for Assistant 
Professor include a minimum of two years of postdoctoral research experience 
and for Associate Professor a minimum of three years of experience at the level 
of Assistant Professor and a NIH RO1 or other significant extramural funding. 


The successful candidate is expected to develop and maintain a research 
program related to cardiovascular disease that is complimentary to the 
department’s strengths, using pharmacology principles, theories, concepts, 
methods and techniques in order to design studies, manage projects and carry 
out investigations. Additionally, the candidate must maintain scholarly activity. 
Teaching duties will include delivery of lectures to graduate and medical 
students as well as participation in active learning sessions of graduate and 
medical students. The candidate will be required to provide service to the 
department and institution by serving on committees. Additionally, specific 
duties and distribution of effort in teaching, research, and service duties will 
be determined by the Department Chair and subject to change based on the 
needs of the department. 


Tulane University is an Affirmative Action and Equal Opportunity Employer 
and is committed to providing equal employment opportunity to qualified 
persons without regard to race, sex, religion, national origin, age, disability, 
gender identity, veteran s status, or any other status or classification protected 
by federal, state or local law. 


To submit an application, please apply online through the Tulane University 
employment portal. Use the following link: http://apply.interfolio.com/62633, 
or copy/paste into your browser; then click “Apply for this Job” to fill out an 
application and submit your documents. Please do not send documents via email. 
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myIDP: 
A career plan customized 
for you, by you. 


Science 


Features in myIDP include: 
= Exercises to help you examine your skills, interests, and values. 


= Alist of 20 scientific career paths with a prediction of which ones best fit your 
skills and interests. 


= A tool for setting strategic goals for the coming year, with optional reminders to 
keep you on track. 


= Articles and resources to guide you through the process. 
= Options to save materials online and print them for further review and discussion. 


= Ability to select which portion of your IDP you wish to share with advisors, 
mentors, or others. 


=" Acertificate of completion for users that finish myIDP. 


Visit the website and start planning today! 
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Careers in partnership with: 


RAYAAAS 


GH FASEB Yorn, UGE WELCOME 
Massachusetts : 


UMASS. Medical School University of California 
San Francisco 


Re ara leratic aed 


GOVERNMENT OF INDIA 
Ministry of Human Resource Development 
(Department of Higher Education) 
Technical Section - I 


Appointment of Director, IIT Kharagpur 


Applications are invited for appointment to the post of Director of Indian Institute of Technology (IIT) at Knaragpur. 
The Director of an IIT is the academic and administrative head of the Institution. He/she is expected to have a 
minimum of 5 years’ administrative experience and leadership qualities to head an Institute of National 


importance. The candidate/person should be a Ph.D. with first class or equivalent at the preceding degree, 
preferably in a branch of Engineering. In exceptional cases, candidates with Science, Mathematics or 
Management degrees may be considered. He/she should have an outstanding academic record throughout and a 
minimum of 10 years teaching experience as a Professor in a reputed Engineering or Technology Institute or 
University and should have guided Ph.D. students. The applicant should preferably be less than 60 years of age 
on the last date of receipt of the applications. The post carries a fixed pay of Rs. 2,25,000/- (Revised) per month, 
with allowances as per rules. 

2. Interested individuals may apply giving their detailed resume in the prescribed format clearly bringing out 
research, teaching, industry-academia collaborations and administrative achievements, alongwith a two-page 
justification in support of their candidature, a two-page vision statement for the institution and contact details of at 
least two distinguished individuals well acquainted with their work. The application typed in the prescribed format 
along with enclosures may be sent by Registered/Speed Post to The Under Secretary (TS.1), Department of 
Higher Education, Ministry of Human Resource Development, Room No. 428 "C" Wing, Shastri Bhawan, 
New Delhi -110 001 so as to reach the Ministry on or before 20" July, 2019. The detailed advertisement and 
the format of application is available on the website (www.mhrd.gov.in). 
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WORKING LIFE 


By Kate Bredbenner 


One Ph.D., hold the pastries 


never thought I would spend so much of my time and money setting up still-life-worthy displays 

of flaky croissants and shiny fruit for people who are judging my science, and that of my col- 

leagues. Yet that’s the expectation: At my university, and many others, students bring food to our 

thesis committee meetings and defenses, adding to the already sky-high pressure. My first taste 

of it came 5 years ago, for my first committee meeting. I prepared furiously. I meticulously proof- 

read my written proposal and aligned all the figures. My slides all used the same font. I had even 
prepared some extra slides to address possible questions my judges might ask. Even so, I was sure the 
meeting was doomed—because I didn’t know how to make coffee. 
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In my mind, one drop of burnt cof- 
fee would cause my judges to kick 
me aside like a stray dog. One bite 
of stale pastry would put them in a 
mood so foul I would be carted back 
to the humble Pennsylvania town 
where I grew up, then forced into 
my high school job as a Wendy’s 
drive-thru operator. Of course, I 
knew logically that these things 
were unlikely to happen, but the 
pressure of a committee meeting 
often sends logic out the window. 

So in the hours before the meet- 
ing, when I should have been look- 
ing over my slides, skimming key 
papers, or even relaxing a bit, I was 
standing numbly in front of the 
lab coffee maker—a Rubik’s Cube I 
couldn’t solve. Discouraged, I tack- 
led a problem more within my reach: 
plating the fresh almond crois- 
sants I had trekked 20 blocks uptown to fetch earlier that 
morning. (I had made the mistake of wearing my meeting 
outfit—a button-down that made me feel like a capable 
adult—and the humidity had left me with pit stains like a 
character in a deodorant commercial; good thing I had a 
backup outfit.) 

But once the croissants were settled on the wooden cut- 
ting board and the perfectly ripe grapes were in their cute 
owl-decorated bowl, I turned back to the dreaded coffee 
maker and felt my perfectly prepared world crumbling 
around me. Out of the corner of my eye, I saw Joan. One 
year ahead of me in grad school, Joan had already been 
through the gauntlet I was about to face. Even more impor- 
tant, she was a fervent coffee drinker. 

“Hey Joan!” I yelped, trying to keep my cool. “How would 
you like to help me make coffee for my meeting?” She gave me 
a knowing look, the kind of look that says she’s been there and 
come out the other side. Of course, she agreed to help. 


“Committees shouldn't 
expect students to provide 
lavish spreads.” 


In the end, the meeting went 
fine. But my lack of coffee-making 
knowledge shook my confidence in 
my abilities as a scientist more than 
it should have. 

Since then, the grad students in 
my lab have made a pact to help 
each other with snacks for these 
meetings. This year, we banded 
together to create a delectable of- 
fering for Joan’s thesis defense. 
She had been through all our com- 
mittee meetings with us, and if we 
could help her with some sand- 
wiches and a fruit salad, we were 
going to do it. She didn’t need the 
stress of remembering when to hit 
the button on the coffee maker 
while also remembering the mo- 
lecular weights of all the proteins 
in the nuclear pore complex. 

Our camaraderie doesn’t make 
the expectation less of a burden, though. Graduate stu- 
dents want to do everything in our power to please our 
committee members as we describe our research plans 
and show we are capable scientists. But that should mean 
spending time preparing and thinking, not stressing our- 
selves out to buy and serve croissants. I’m lucky to have 
some of the best fellow grad students-slash-event planners 
anyone could ask for, but many students don’t have this 
sort of network or tradition. 

The solution is easy: Committees shouldn’t expect stu- 
dents to provide lavish spreads, or anything at all. We 
shouldn’t have to spend our money buying overpriced fruit 
salad or know how to make coffee to be considered success- 
ful graduate students. Our research should be enough. 


Kate Bredbenner is a graduate fellow at Rockefeller 
University in New York City. Send your career story 
to SciCareerEditor @aaas.org. 
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